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9 13 27 36 45 54 

S' ATG ATA AAC GTT CCA ACC GGA GAG GAG ACC CCA ATA CAC CTC TTT GGA GTC AAC 



Mac Ha Asa val Ala. Thr Gly Glu Glu Thr Pro He His Lea Phe Gly val Asr. 

S3 . 72 81 90 99 108 

TGG TTC GGC TTT GAG ACA CCG AAC TAC GTT GTT CAC GGC CTA TGG AGT AGG AAC 



Trp She Gly Phe Glu Thr Pro Asn Tyr Val Val His Gly Leu Trp Sar Ary Asn 

117 12« 135 144 153 1S2 

TGG GAG GAC ATC CTC CTC CAG ATC AAG AGC CTT GGC TTC AAT GCG ATA AGG CTT 



Trp Glu Asp Met Leu Leu Gin lie Lys Ser Leu Gly Phe Asa. Ala II* Arg Leu 

IT! 180 189 198 207 218 

CCC TTC TST ACC CAG TCA GTA AAA CCG GGG ACG ATG CCA ACG GCG ATT GAC TAC 



Pro She Cys Thr Gla Sar val Lys Pro Gly Thr mt Pro Thr AU tie Asa Tyr 

22S 234 243 252 2S1 270 

GCC AAG AAC CCA GAC CTC CAG GGT CTT GAC AGC GTC CAG ATA ATG GAG AAA ATA 



Ala Lys Asa Pro Aso Leu Gin Gly Leu Asp Ser Val Gin He Mec Glu Lys lie 

279 298 297 306 315 324 

ATC AAG AAG GCT GGA GAC CTG GGC ATA TTC GTG CTC CTC GAC TAC CAC AGA ATA 



He Lys Lys Ala Gly Asp Leu Gly He Phe Val Leu Leu Asp Tyr Kis Arg He 



333 34? 351 3S0 369 378 

GGA TGC AAC TTC ATA GAA CCC CTA TGG TAG ACC GAC AGC TTC TCG GAG CAG GAC 



Gly Cys Asn Phe He Glu Pro Leu Trp Tyr Thr Asp Ser Phe Ser Glu Gin Asp 

387 396 4QS 414 423 432 

TAG ATA AAC ACC TGG GTT GAA GTC GCC CAG AGG TTC GGC AAG TAC TGG AAC GTT 



Tyr lie Asn. Thr Trp Val Glu Val Ala Gin Arg Phe Gly Lys Tyr Trp Asr. Val 

441 450 459 463 477 436 

ATC GGC CCG GAC CTG AAG AAC GAA CCC CAC AGC TCA AGC CCC GCA CCT GCC GCC 



Ue Gly Ala Asp Leu Lys Asn Glu Pro His Ser Ser Ser Pro Ala Pro Ala Ala 

495 S04 S13 S22 531 540 

TAC ACT GAC GGA AGT GGG GCC ACG TGG GGA ATG GGC AAC AAC GCC ACC GAC TGG 



Tyr Thr Asp Gly Sar Gly Ala Thr Trp Gly Met Gly Asn Asn Ala Thr Asp Trp 

549 559 S67 57S SS5 594 

AAC CTG GCG GCT GAG AGG ATA GGA AGG GCA ATT CTG GAG GTT GCC CCA CAA TGG 



Asn Leu Ala Ala Glu Arg Xlc Gly Arg Ala ZL« L«u Glu Val Ala Pro Gin Trp 

603 612 621 630 639 648 

GTT ATA TTT GTT GAG GGA ACC CAG TTC ACC ACC CCC GAG ATA GAC GGT AGG TAC 



val Il« Phe Val Glu Gly Thr Gin Phe Thr Thr Pro Glu He As? Gly Arg Tyr 

657 66S 675 634 633 702 

AAG TGG GGC CAC AAC GCC TCG TGG GGC GGA AAC CTT ATG GGT GTT AGG AAG TAC 



Lys Trp Gly His Asn Ala Trp Trp Gly Gly Asn Leu Met Gly Val Arg Lys Tyr 



711 720 729 739 747 7SS 

CCA GTT AAC CTG CCC AGO GAC AAG GTT OTT TAC AGC CCC CAA GTT TAC GOT TCA 



Pro Val Asa Leu Pro Arg Asp Lys Val Val Tyr Ser Pro Gin Val Tyr Giy Ser 

765 774 733 792 801 810 

GAA GTT TAC GAC CAG CCC TAC TTT GAC CCC GGT GAG GGG TTC CCC GAC AAC CTC 



Gin Val Tyr Asp Gin Pro Tyr Phe Asp Pro Gly Glu Gly Phe Pro Asp Asn Leu 

819 628 637 846 855 864 

CCC GAA ATA TGG TAC CAC CAC TTC GGC TAC GTA AAG CTT GAT CTC GGT TAC CCT 



Pro Glu He Trp Tyr His Kis Pha Gly Tyr Val Lys Leu Asp Lau Gly Tyr Pro 

973 832 891 900 909 918 

GTT GTT ATA GGT GAG TTC GGA GGC AAG TAC GGC CAT GGG GGA GAG CCS AGG GAT 



Val Val He Gly Glu Phe Gly Gly Lys Tyr Gly Kis Gly Gly Asp Pro Arg Asp 

927 936 945 954 953 972 

GTC ACT TGG CAG AAC AAG ATA ATA GAC TGG ATG ATC CAG AAC AAA TTC TGT GAC 



Val Thr Trp Gin Asn Lys He He Asp Trp Mat lie Gin Asa Lys Phe Cys Asp 

981 990 999 1008 1017 1026 

TTC TTC TAC TGG AGC TGG AAC CCA AAC AGC GGT GAC ACC GGT GGA ATT CTG AAG 



Phe Phe Tyr Trp Ser Trp Asn Pro Asn Ser Gly Asp Thr Gly Gly lie Leu Lys 

103S 1044 10S3 10S2 1071 1080 

GAT GAC TGG ACG ACA ATA TGG GAG GAC AAG TAC AAC AAC CTG AAG AGG CTC ATG 



Asp Asp Trp Thr Thr He Trp Glu Asp Lys Tyr Asn Asn Leu Lys Arg Lau Met 



1089 1098 1107 111S 1L25 1134 

GAC AGC TGT TCT GGA AAC GCC ACT GCC CCG TCC GTC CCC ACG ACA ACT ACA ACA 



Asp Ser Cys- Ser Gly Asn Ala Tiir Ala Pro Ser Val Pro Thr Thr Thr Thr Thr 

1143 1152 11S1 1170 1179 1188 

ACA AGC ACA CCG CCA ACG ACC ACA ACG ACT ACA ACA TCC ACT CCA ACG ACC ACT 



Thr Ser Thr Pro Pro Thr Thr Thr Thr Thr Thr Thr Sar Thr Pro Thr Thr Thr 

1197 1205 1215 1224 1233 1242 

ACC CAG ACC CCG ACC ACC ACT ACT CCA ACT ACG ACA ACC ACC ACG ACC ACA ACT 



Thr Gin. Thr Pro Thr Thr Thr Thr Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr 
1251 1250 1259 1278 1237 129S 

CCT TCA AAT AAC GTC CCA TTT GAA ATT GTG ARC GTT CTC CCG ACT AGC TCC CAG 



?ro Ser Asa. Asn Val Pro Pha Glu He Val Asn Val Leu Pro Thr Ser Ser Gin 

130S 1314 1323 1332 1341 13S0 

TAG" GAG GGA ACC AGC GTG GAG GTT GTA TGT GAT GGA ACC CAG TGT GCC TCC AGC 



Tyr Glu Gly Thr ser Val Glu Val Val Cys Asp Gly Thr Gin Cys Ala Ser Ser 

1359 1368 1377 1335 139S 1404 

GTT TGG GGA GCT CCG AAC CTC TGG GGA GTC GTT AAA ATC GGA AAC GCC ACC ATG 



val Trp Gly Ala Pro Asn Leu Tr? Gly val val Lys lie Gly .Asn Ala Thr Met 

1413 1422 1431 1440 1449 1458 

GAC CCC AAC GTT TGG GGC TGG GAG GAC GTT TAC AAG ACT GCA CCC CAG GAC ATT 



Asp Pro Asn Val Trp Gly Trp Glu Asp Val Tyr Lys Thr Ala Pro Gin Asp lie 
14S7 1476 1485 1494 1S03 1512 



GCA ACC GGC AGC ACA AAG ATC GAG ATA AGG AAC GGG GTG CTC AAG GTT ACA AAC 



Gly Thr Gly Ser Thr Lys Met Glu lie Arg Asn Gly Val Leu Lys val Thr Asn 

1S21 1S30 LS39 1548 1557 1S5S 

CTC TGG AAC ATC AAC ATG CAT CCG AAG TAT AAC ACA ATG GCA TAC CCG GAG GTC 

Leu Trp Asn lie Asn Met Kis'Pra iys Tyr Asn Thr Met Ala Tyr Pro Glu Val 

1S7S 1584 1S93 1602 1611 1S20 

ATA TAC GGC GCC AAG CCT TGG GGC AAC CAG CCA ATA AAC GCT CCG AAC TTC GTG 



He Tyr Gly Ala. Lys Pro Trp Gly Asa Gin Pro lis Asn Ala Pro Asn Phe val 

1629 1638 1647 1656 16SS 1674 

CTC CCG ATA AAG GTC TCC CAG OCT CCG AGG ATA CTC GTT GAC ACA AAG TAC ACQ 



Leu Pro lie Lys val Ser Gin Leu Pro Arg lie Leu Val Asp Thr Lys Tyr Thr 
1683 1692 1701 1710 1719 1728 

CTC GAA AAG AGC TTC CCG GGA AAC AAC TTC GCC TTT GAG GCC TGG CTC TTC AAG 



Leu Glu Lys Ser Phe Pro Gly Asn Asn Phe Ala Phe Glu Ala Trp Leu Phe Lys 

1737 1746 17S5 1764 1773 1782 

GAT GCC AAC AAC ATG AGG GCA CCA GGC CAG GGG GAC TAC GAG ATA ATG GTA CAG 



Asp Ala Asn Asn Mat Arg Ala Pro Gly Gin Gly Asp Tyr Glu He Met Val Gin 

1791 1800 1809 1818 1827 1836 

CTC TAC ATC GAG GGC GGC TAT CCT GCG GGC TAC GAC AAG GGG CCA GTT CTC ACC 



Leu Tyr He Glu Gly Gly Tyr Pro Ala Gly Tyr Asp Lys Gly Pro Val Leu Thr 

1845 1854 1863 1872 1981 1890 

GTT GAT GTT CCG ATA ATC GTC GAT GGA AGG CTT GTA AAC CAG ACT TTT GAG CTC 



Val Asp Val Pro He He Val Asp Gly Arg Leu VaL Asn Gin Thr Phe Glu Leu 

1839- 1908 191? 192S 1935 1944 

TAC GAC GTC ATA GCG GAT GCC GGA TGG AGG TTC TTC ACC TTC AAG CCA ACT AAG 



Tyr Asp VaL lie Ala Asa Ala Gly Trp Arg Phe Phe Thr Phe Lys Pro Thr Lys 

19S3 1952 1971 1980 1989 1999 

AAC TAC AAC GGC TCA GAG GTT GTG TTC GAC TAC ACC AAA TTC ATA GAA ATA GTT 



Asa Tyr Asn Gly Ser Glu Val Val Pha Asp Tyr Thr Lys Phe He Glu Ila Val 

2007 201S 2025 2034 2043 2032 

GAC AAC TAC CTC GGC GGT GGC AGC CTC ACG AAC CAC TAC CTG ATG TCC CTG GAA 



Asp Asn Tyr Leu Gly Gly Gly Ser Leu Thr Asn Kis Tyr Leu Mec Ser Leu Glu 

20S1 2070 2079 2088 2097 2105 

TTC GGT ACC GAG ATA TAC ACC AAC GGG TGC ACC TCA TTC CCA TGC ACA GTG GAC 



Phe Gly Thr Glu lie Tyr Thr Asn Gly Cys Thr Ser Pha Pro Cys Thr Val Asp 
2115 . 2124 2133 2142 21S1 21S0 

GTA AGG TGG ACC CTT GAC AAG TAC AGG TTC ATC CTG GCC CCA GGA ACA ATG GCC 



Val Arg Trp Thr Leu Asp Lys Tyr Arg Phe He Leu Ala Pro Gly Thr Met Ala 

2169 2178 2187 2196 2205 2214 

ACT GAG GAG GCC ATG AGA GTT CTC GTC GGA GAG GTC CAG CCT CCC GCT TCC ACA 



Thr Glu Glu Ala Met Arg Val Leu Val Gly Glu Val Gin Pro Pro Ala Ser Thr 

2223 2232 2241 22S0 22S9 22S8 

ACA ACA TCG CAG ACG ACT ACT TCA ACC ACA ACC CCA ACG CCC ACT ACC ACT ACT 



Thr Thr Ser Gin Thr Thr Thr Ser Thr Thr Thr Pro Thr Pro Thr Thr Thr Thr 

2277 22BS 2295 2304 2313 2322 

ACG ACT CAG.ACT TCA ACC ACC ACT ACA ACC ACC TCA CCG CCG ACA ACC ACG GCA 

Thr Thr Gin Thr Ser Thr Thr Thr Thr Thr Thr Ser Pro Pro . .Thr Thr Thr Ala 

2331 2340 2349 23S8 23S7 2376 

CCT GCT CAG GAC GTA ATT AAG CTC AGG TAC CCG GAC GAT GGG CAG TGG CCC GAG 

Pro Ala Gin Asp val lie Lys Leu Arg Tyr Pro Asp Asp Gly Gla Trp Pro Glu 

2395 2394 2403 2412 2421 2430 

GCC CCA ATT GAC AGG GAT GGA GAC GGA AAC CCA GAG TTC TAC ATA GAA ATA AAC 

Ala Pro lie Asp Arg Asp Gly Asp Gly Asn Pro Glu Phe Tyr He Glu He Asn 

2439 2448 2457 24SS 2475 2484 

CCG TGG AAC ATA CTG AGC GCT GAA AGC TAC GCC GAG ATG ACC TAC AAC TTG AGC 

Pro .Trp Asa II* Leu Ser Ala Glu Ser Tyr Ala Glu Met Thr Tyr Asr. Lau Ser 

2493 2S02 2511 2520 2529 

AGC GGG GTT CTC CAC TAC GTC CAG GCC CTG GAT AGT ATA TGA TGA 3' 

Ser Gly Val Leu His Tyr Val Gin Ala Leu Asp Ser lie •*» "» 
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ATC CCA ACC AAT GTA TTT TTC AAC GCC CAT CAC TCG CCG GTT GGG GCG TTT 
Mac Pro Thr Asn Val Phe Phe Asn Ala His His See Pro Val Gly Ala Pha 

GCC AGC TTT ACG CTA GGG "TTT CCG GGA AAA AGC GGA GGA CTG GAC TTG GAA 
Ala Sar Pha Thr Lau Gly Pha Pro Gly Lys sar Gly Gly Leu Asp Leu Glu 

CTT GCC CGA CCG CCA CGG CAA AAT GTC TTT ATT GGC GTT GAG TCG CCG CAT 
Lau Ala Arg Pro Pro Arg Gin Asn Val Pha Ha Gly Val Glu Sar Pro His 

GAG CCG GGG CTG TAT CAT ATC CTT CCA TTC GCG GAA ACA GCA GGC GAG GAT 
Glu Pro Gly Lau Tyr Els Tla Leu Pro Pha Ala Glu Thr Ala Gly Glu As? 

GAA AGC AAA CGA TAT GAC ATT GAA AAT CCT GAT CCG AAT CCG CAA AAA CCA 
Glu Sar Lya Arg Tyr Asp Ha Glu Asn Pro Asp Pro Asa Pro Gla Lys Pro 

AAC ATC CTG ATT CCA TTT GCG AAA GAS CGG ATC GAA CGC GAA TTT CGC GTT 
Asti lie Leu tie Pro ?he Ala Lys Glu Arg tie Glu Arg Glu Pha Arg val 

GCC ACG GAT ACA TGG AAG GCC GGG GAC TTG ACG TTG ACG ATT TAT TCA CCG 
Ala Thr Asp Thr Trn Lys Ala Gly Asp Lau Thr Lau Thr tie Tyr Ser Pro 



GTG AAG GCC GTA CCA GAT CCG GAA 
Val Lys Ala Val Pro Asp Pro Glu 

GCG TTG GTT CCA GCT GTC ATT GTC 

Ala Leu Val Pro Ala Val He Val 

ACA AGA ACA CGA CGC- GCG 7TT TTC 
Thr Arg Thr Arg Arg Ala Pile Phe 

TCG ATG CGG GGG ATC GAT GAT ACA 
Ser Met Arg Gly lis Asp Asp Thr 

GGG CGG ATT TTG GGC ATA GCA TCC 
Giy Arg He Leu Gly lis Ala Ser 

CAT TTT AGC ATG GAG GAT ATC TTA 
His Phe Ser Mat Glu Asp II* Leu 

TTT GGG CTC GGG AAA GTC GGT GCA 
Phe Gly Leu Gly Lys Val Gly Ala 

AAG AAA ACG TAT CAA TTT GCT GTT 
Lys Lya Thr Tyr Glr. Phe Ala Val 



ACS GCC TCC GAG GAA GAA CTC AAG TTG 
Thr Ala Ser Glu Glu Glu Leu Lys Leu 

GAG ATG ACG ATC GAT AAT ACG AAC GGA 

Glu Met Thr He Asp Asn. Thr Asr. Gly 

GGA TTC GAA GGC ACT GAC CCG TAT ACC 

Gly Phe Glu Gly Thr Asp Pro Tyr Thr 

TGC CCG CAG CTG CGC GGT GTC GGT CAA 
Cys Pro Gla Leu Arg Gly Val Gly Gin 

AAG GAT GAG GGC GTT CGT TCA GCA CTG 
Lys Asp Glu Gly Val Arg Ser Ala Lau 

ACG GCG ACT CTC GAA GAA AAC TGG ACG 
Thr Ala Thr Leu Glu Glu Asn Tr? Thr 

TTA ATT GCG GAT GTG CCG GCG GGA GAA 
Lau He Ala Asp Val Pro Ala Gly Glu 

TGC TTC TAT CGT GGG GGT TGT GTG ACG 
Cys Phe Tyr Arg Gly Gly Cys Val Thr 



GCG GGA ATG GAT GCC TCT TAT TTT TAC ACC CGT TTC TTC CAT AAT ATC CAA 
Ala' Gly Mar Asp Ala Ser Tyr Phe Tyr Thr Arg Phe Ph.* His Asn lie Glu 

GAA GTC GGT CTT TAT GCG TTA GAG CAG GCC GAG GTG TTA AAA GAG CAG GCG 
Glu val Gly Leu Tyr Ala Leu Glu Gin Ala Glu val Lau Lys Glu Gin Ala 

TTC CGT TCG AAT GAA CTC ATT GAA AAA GAA TGG CTC TCC GAT GAT CAA AAG 
Pile Arg Ser Asn. Glu Leu lie Glu Lys Glu Trp Leu Ser Asp Asp Gin Lys 

TTT ATG ATG GCG CAC GCG ATC CGT AGC TAC TAT GGC AAT ACA CAG CTG CTT 
Phe Met Met Ala His Ala He Arg Ser Tyr Tyr Gly Asn Thr Gin Leu Leu 

GAG CAT GAA GGA AAG CCG ATT TGG GTC GTC AAT GAA GC-C GAG TAC CGG ATG 
Glu His Glu Gly Lys Pro He Trp Val Val Asn Glu Gly Glu Tyr Arg Mac 

ATG AAT ACG TTT GAT CTC ACC GTC GAC CAG CTC TTT T™ GAA TTG AAA ATG 
Met Asa Thr Phe Asp Leu Thr Val Asp Gin Leu Fha Ph* Glu Leu Lys Met 

AAT CCG TGG ACG GTG AAA AAT GTG CTT GAC TTT TAT GTC GAG CGC TAC AGC 
Asn Pro Trp Thr Val Lys Asn Val Leu Asp Phe Tyr Val Glu Arg Tyr Ser 

TAT GAG GAT CGT GTC CGT TTC CCA GGA GAT GAG ACG GAA TAC CCC GGC GGC 



Tyr Glu Asp Arg val Arg Phe Pro Gly Asp GLu Thr Glu Tyr Pro Gly Gly 

ATC 'AGC TTC ACT CAC GAT ATG GGA GTC GCC AAC ACG TTC TCA CGC CCG CAT 
lie Ser Phe Thr His Asp Met Gly Val Ala Asn Thr Phe Ser Arg Pre Kis 

TAG TCG TCA TAT GAG CTA TAC GGG ATC AGC GGC TGC TTT TCA CAT ATG ACG 
Tyr Ser Ser Tyr Glu Lau Tyr Gly lie ser Gly Cys Phe ser Sis flee Thr 

CAC GAA CAG CTC GTC AAC TGG GTG CTT TGC GCA GCG GTA TAC ATC GAA CAA 
His Glu Gin Leu val asc Trp Val Lau Cys Ala Ala Val Tyr lis Glu Gin. 

ACG AAA GAC TGG GCA TGG CGC GAC CGG CGG CTT ACG ATC TTG GAA CAA TGT 
Thr Lys Ac? Trp Ala Trp Arg Asp Arg Arg Leu Thr He Leu Glu Gin Cys 

CTC GAA AGC ATG GTG CGC CGC GAT CAT CCG GAT CCA GAA AAG CGG AAC GGC 

Lau Glu Sar Met Val Arg Arg Asp Kis Pro Asp Pro Glu Lys Arg Asn. Gly 

GTG ATG GGG CTT GAC AGC ACC CGC ACG ATG GGT GGA GCG GAA ATC ACA ACG 

Val Met Gly Leu Asp Ser Thr Arg Thr Met Gly Gly Ala Glu He Thr Thr 

TAT GAT AGT TTG GAT GTT TCT CTT GGC CAG GCG CGC AAC AAT TTA TAT TTG 
Tyr Asp Ser Leu Asp Val Ser Lau Gly Gin Ala Arg Asn Asn Leu Tyr Leu 



GCA GGA AAA TGT TGG GCT GCC TAT GTG GCG CTC GAA AAG TTG TTC CGC GAT 
Ala Gly Lys Cys Trp Ala Ala Tyr val Ala Leu Glu Lys Leu Ptie Arg Asp 

GTC GGC AAA GAA GAA CTG GCT GCA TTG GCA AGG GAG CAG GCG GAA AAA TGC 
val Gly Lys Glu Glu Leu Ala Ala Leu Ala Arg Glu Gin Ala Glu Lys Cys 

GCC GCG ACQ ATT GTC AGT CAC GTG ACG GAG GAC GGG TAT ATC CCA GCC GTG 
Ala Ala Thr lie Val Ser His Val Thr Glu Asp Gly Tyr lie Pro Ala Val 

ATG GGA GAA GGA AAX GAC TCG AAA ATC ATT CCG GCT ATT GAG GGG CTT GTG 
Mec Gly Glu Gly Asa Asp Ser Lys lis lie Pro Ala lie Glu Gly Leu Val 

TTT CCT TAC TTT ACG AAC TGC CAT GAG GCG TTA AGA GAA GAC GGA CGT TTT 
Pha Pro Tyr Pfce Thr Asn Cys His Glu Ala Leu Arg Glu Asp Gly Arg Phe 

GGA GAC TAT ATT CGT GCA CTG CGA CAA CAT TTG CAA TAT GTG TTG CGG GAA 
Gly Asp Tyr He Arg Ala Leu Arg Gla His Leu Gin Tyr Val Leu Arg Glu 

GGA ATT TAC CTA TTC CCG GAC GGG GGA TOG AAA ATT TGC CTC GAC AAG CAA 
Gly He Tyr Leu Phe Pro Asp Gly Gly Trp Lys He Cys Leu Asp Lys Gla 

CAA CTC GTG GTT GAG CAA AAT TTA CTT ATG CCA GTT TAT TGC CCG CCG CAT 
Gin Leu Val Val Glu Gin Asn Leu L«u Met Pro Val Tyr Cys Pro Pro His 

TTT AGG GTG GGA ATG GGA TGA 1953 

Phe Arg Val Gly Mec Gly END 
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ATG.TTG AAA AAA CTG GCT TTA GCA GCC GGG ATC GCA GCA GCA ACA CTG GCT 
Met Lau Lys Lys Leu Ala Lau Ala Ala Gly lie Ala Ala Ala Tf.r Leu Ala 

GCA TCC GGT 7CC CAT GGG CAG ACG TTC GCG TAC GGC GAA GCT CTG CAA AAA 
Ala Ser Gly S*r Kis Gly Gla Tfcr She Ala Tyr Gly Glu Ala Leu Gin Lys 

TCC ATC TAT TTT TAT GAG GCT CAA CAG GCC GGC CCA CTC CCG GAA TGG AAC 
Ser He Tyr Pas Tyr Glu Ala Gin Gin. Ala Gly Pro Lau Pro Glu Tr? Asa 

CGC GTT GCC TGG CGT GGC GAC TCA GTT CCT GAT GAC GGT GCC GAC GTC GGA 
Arg Val Ala Trp Arg Gly Am Ser Val Pro Asp Aaa Gly Ala Asp Val Gly 

CTG GAT TTA CGC GGT GGC TGG TTC GAT GCG GGC GAC CAC GTT AAG TTT GGC 
Lau A3? Leu Arg Gly Gly Trp Pile Asp Ala Gly As? Kia val Lys ?ha Gly 

TTT CCA AT3 GCC GCG TCA GCG ACA CTC GTC GCC TGG GGA GGC GTC GAT TAC 
Phs Pro Me; Ala Ala Ser Ala Thr Lsu Val Ala Trp Gly Gly Val Asp Tyr 

AAA GAC GCG TAC CAA CAG TCG GGG CAA ATC GAA CAT CTG CGC AAC AAC CTG 
Lys Asp Ala Tyr Glu Gin 3«r Gly Gla Mec Glu Kis Leu Arg Asn Asn Leu 



CCC TTC GTC AAT GAC TAC TTT ATC 
Arg Phe Val Asn Asp Tyr Phe lie 

TAC GGG CAG GTT GGC GAT GGC AGT 

Tyr Gly Gin val Gly Asp Gly Ser 

GAG GTT CTG CAC CAC AAG ATC CCC 
Glu Val Lau His His Lys lie Pro 

GAA AGC TGC CCG GGT ACC GAT CTG 
Glu S«r Cys Pro Gly Thr Asp Leu 

GCG TCT GCG ATG GTT TTT CAG GGT 
Ala Ser Ala Met Val Phe Gin Gly 

ATC ACT CAC GCC AAA CAG CTG TGG 
He Thr His Ala Lys Gin Leu Trp 

ACC GGT ACA GAT ACA GCC TAT TCC 
Thr Gly Thr Asp Thr Ala Tyr Ser 

TAT ACG TCG ACG TAT GGC GTT TAC 
Tyr Thr Ser Thr Tyr Gly Val Tyr 



AGC GCG CAC CCC GCT CCG AAC GTG CTT 
Ser Ala His Pro Ala Pro Asn Val Leu 

GCA GAC CAT ACC TTC TGG GGT CCC GCT 
Ala Asp His Thr Phe Tr? Gly Pro Ala 

GGC TCG CGC ATT TCT ATG AAG ATT GAC 
Gly Sar Arg lie Ser Mac Lys He Asp 

GCC GCA GAG ACC GCA GCA GCG ATG GCC 
Ala Ala Glu Thr Ala Ala Ala Met Ala 

GAG GAC GAT GCT TAC GCA GCA ACC CTG 
Glu Asp Asp Ala Tyr Ala Ala Thr Leu 

CAA TTT GCT GAT TCA ACC AAA GSC ACA 
Gin Phe Ala Asp Sar Thr Lys Gly Thr 

AAT TGC ATA ACA GGT GCA CAG GGC TTT 

Asn Cys II a Thr Gly Ala Gin Gly Phe 

TAC GAT GAA CTT GCC TGG GGT GCT CTC 
Tyr Asp Glu Leu Ala Trp Gly Ala Leu 



TGG TT.\ TGG CGC GCA ACT GGA GAA GAC TTC TAC CTG GAA CAA GCC AAG CAT 
Trp Leu Trp Arg Ala Thr Gly Glu As? Phe Tyr Leu Glu Gin Ala Lys Kis 

TAC TAC GGT TTG ATG GGC TTT GAA AAC CAG ACG ACA ACT CCG GTA TAT ACC 
Tyr Tyr Gly Leu Met Gly Phe Glu Asn Gin Thr Thr Thr Pra Val Tyr Thr 

TGG TCG CTT GGC TSG AAC GAT AAA GCG TAT GCC GTT TAT GTA CTT ATG GCC 
Trp Ser Leu Gly Trp Asn Asp Lys Ala Tyr Ala Val Tyr Vai Leu Man Ala 

GCA CTT GTA GGT GAC GAG GTT TAC CAC GCA GAT GCA CAG CGC TAC CTC- GAT 
Ala Leu Val Gly Asp Glu Val Tyr His Ala Asp Ala Glr. Arg Tyr Leu Asp 

CAC TGG AGC GTC GGC GAG GGT AAC CGC ACA CGC AAT GGG CTG ATT CTG GTC 
His Trp Ser Val Gly Glu Gly Asn Arg Thr Pro Asn Gly Leu lie Leu val 

GAC TCC TGG GGG GTA AAC CGC TAT GCG GCC AAC GCG GGT TAT CTC GCA CTC 
Asa Ser Trp Gly val Asn Arg Tyr Ala Ala Asn Ala Gly Tyr Leu Ala Leu 

TTT TAT GCA GAT GCG ATT GGC ACT GAC CAC CCC CTT TAT GAT CGT TAC CAC 
Phe Tyr Ala Asp Ala lie Gly Ser Asp His Pro Leu Tyr Asp Arg Tyr His 

AAT TTT GGT AAG AAG CAG ATC GAT CAT ATC CTG GGC GAC AAC CCT GAC AAC 



Asn Phe Gly Lys Ly3 Gin lie Asp His lie Leu Gly Asp Asn Pro Asp Asn 

CAA AGC TAC GTC GTC GGC TTT GGC GAT AAT TTC CCA ATC AAT GTT CAC CAC 
Gin Ser Tyr Val Val Gly Phe Gly Asp Asn Phe Pro He Aan Vail Kis Kia 

CGT GGC TCC CAC GGT TCC TGG TCC GAC AGC ATT TCC AAC CCG GTT AAT CAA 

Arg Gly Ser His Gly Ser Tip Ser Asp Ser lis Ser Asn Pro Val Asr. Gin 

CGC CAT GTG CTA TAC GGC GCA GTT GCC GGT GGT CCG CAG GGC GAT ACA GGC 
Arg His Val Leu Tyr Gly Ala Val Ala Gly Gly Pro Gin Gly Asp Thr Gly 

TAT GAA GAA GAC CGC AAT GAC TAT GTG CAG AAT GAG GTC GCA ACA GAC TAC 
Tyr Glu Glu Asp Arg Asr. Asp Tyr val Gin Asn Glu Val Ala Thr As? Tyr 

AAC TCA GGC TTC ACC AGT GCC GTC GCT GCA CTT TAT GAT CAC TAT GGT GGC 
Asn Ser Gly Phe Thr Sar Ala val Ala Ala Leu Tyr As? His Tyr Gly Gly 

GCG CCC CTG GCG AAC TTC CCG CCT CCC GAA CCA GAG TCG GTG GAG TAT CTC- 
Ala Pro Leu Ala Asn Phe Pro Pro Pro Glu Pro Glu Ser Val Glu Tyr Leu 

GTG GGG GCC AAG ATC AAT TCC TCT GGC AAC CGC TTC GTG GAA ATG AAA GCC 

Val Gly Ala Lys He Asn Ser Ser Gly Asn Arg Phe Val Glu Mec Lys Ala 



GTT ATT CAA AAC CAC AGC ACA ACA CCC CCC CAA GGT AAA GAC GAC CTT TAC 
Val lie Gin Asn His Ser Thr Thr Pro Ala Gin Gly Lys Asp Asp Leu Tyr 

ATG CGC TAT TTC TAT GAT CTG AGC GAA GTA TTT GCC GCA GGC TAC AGT TTG 

Met Arg Tyr She Tyr Asp Leu Ser Glu Val Phe Ala Ala Gly Tyr Sar Leu 

AAT GAT CTA ACG GTG GCG TCC GGA TAC AAC CAA GCC TCG GAT GTG AAT GGC 

Asa As? Lau Thr val Ala Sar Gly Tyr Asn Gin Ala Sar Asp val Asr. GLy 

CTG CAA CAT TGG GAT GGC AAC GTC TAC TAT GTG GAA GCC CAG TTC TAT SAC 
Leu Gin His Trp Asp Gly Asn Val Tyr Tyr Val Glu Ala Gin Phe Tyr Asp 

GAT GTG GTA TTT CCC GGT GGT CAG TCC GCC- CAC CGA CGG GAA GTA CAA TTT 
Asp Val Val ?ha Pro Gly Gly Gin Sar Ala Kis Are Arg Glu Val Glc ?h* 

CGC GTG TCC CTG CCA ACC ACA TCC AAT CTT GCC GAG TGG GAC AAC ACG AAC 
Arg Val Ser Leu Pro Thr Thr Ser Asn Leu Ala Glu Trp As? Asn Thr Asn 

GAC CCC TCG TTT GAT CCA AGT TAT TTA ACG GTC GAT AGT AGT CTG ACT TAC 

Asp Pro Ser Phe Asp Pro Ser Tyr Leu Thr Val Asp Ser Ser Leu Thr Tyr 

GGT ATC GAC GCG CCG AAA ATT CCA CTC TAC GAC GCC AAC GGC CTG CTC TGG 
Gly lie Asp Ala Pro Lys He Pro Leu Tyr Asp Ala Asn Gly Leu Leu Tr? 



GGC GAG GAG CCA CCC CGT GGC GQA ACT TCC TCC AGC TCA TCG TCG AGC AGT 

Gly Glu Giu Pro Pro Arg Gly Gly Thr Ser Ser Ser Ser Ser Ser Ser Ser 

TCG TCC TCT AGC TCA TCC AGC AGT TCA TCG TCG AGC AGC TCC TCG AGC AGT 
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 

TCG TCC TCG AGT AAT TCG TCC TCT AGC TCG TCC AGC TCT TCG TCG AAT TCT 
Ser Ser Ser Ser Asa Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser As.n Ser 

TCG TCG TCT AAC AGC AGT TCC TCG TCC AGC TCA AGC TCA TCG AGC AGT TCC 
sar sar Ser Asn Ser Sar Sar Ser Sar Sar Sar ser sar Sar Ser sar Sar 

AGT TCG TCG AGT TCG GGC GGC ACC TGT GCG GAC GTG AAC C-TA TAC CCC AAC 
Ser Ser Ser Ser Sar Gly Gly Thr Cys Ala Asp Val Asn Val Tyr Pro Asa 

TGG ACC GCA CGT GAC TGG GCC GGT GGA GTA CCG AAC CAC GCG GAA GCC G3T 
Trp Thr Ala Arg Asp Tr? Ala Gly Gly Val Pro Asn His Ala Glu Ala Gly 

GAT TTG ATG GTT TAC CAA GGT ACT GTC TAC CAA GCT AAT TGG TAC ACC AAC 
Asp Leu Met Val Tyr Gin Gly Thr Val Tyr Gin Ala Asn Tr? Tyr Thr Asn 



AGT GTG CCT GGC AGT GAT GCA TCC TGG ACC AAC CAA GGG TTA TGT GCC GGC 



Ser Val Pro Gly Sar Asp Ala Ser Trp Thr Asr. Gin Gly Leu Cys Ala Gly 

GGC GGA TCC AGC TCC AGC AGC TCA TCA TCC AGC TCA AGC AGC TCT TC~- TCC 
Gly Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 

AGC AGC AGC TCA AGC TCG TCC AGT GGT GCG TCC GGT TCA TCC TCC AGC TCG 
ser ser Ser Ser Ser Ser Ser Ser Gly Ala Ser Gly Sar Ser Ser Ser Ser 

AGC AGT TCG TCC TCG TCA AGT TCG AGC AGC AGC TCT TCG AGT TCG TCT TCT 
ser Sar Ser Ser Ser Sar Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 

GGT GGC GGC GCC AT3 TGT AAC TGG TAT GGC TGG CAA GTA CCT ATT TGT GAA 
Gly Gly Gly Ala Met Cys Asn Trp Tyr Gly Trp Gin Val Pro lie Cys Glu 

AAC ACC CCA TCT GGC TGG GGC AAC GAA AAT GGC CAA ACA TGT GTC GGC CCC 
Asa The Pro Ser Gly Trp Gly Asn Glu Asn Gly Gin Ttir Cys Val Gly ?ra 



GAT ACT TGC CAA GAG GTC GTC AAC TAA 
Asp Ttir Cys Gin Glu Val val Asn EOT 
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ATG AAC ATG ACC TAC ATG CAT CCG GCT GAA GAT ACT 7 AC TCC TTT GOT CAA 
Met Lys Me: Thr Tyr Mec His Pro Ala G!u Asp Thr Tyr sar Pha Gly Gin 

GCG GAT CAG TTG GTfi AAC TGG GCG AAA GCG AAT GGT ATT C-GC C-TG CAC GGC 
Ala Asp Gin Lau val Asn Tr? Ala Lys Ala Asn Gly Ila Gly Val Sis Gly 

CAC ACT CT3 GTT TGG CAC TCC GAA TAC CAG GTA CCC AAT TGG ATG AAA AAT 
His Thr Lau. Val Tr? His Sar Glu Tyr Gin Val Pro Asn Trp Mas lys Asn 

TAC TCT GGT GAT GCA ACT GCA TTC CAA ACC ATG CTC AAC ACC CAT GTG AAA 
Tyr Sar Gly Aap Ala Thr Ala Phe Gla Thr Met Lau Asn Thr Kia Val Lys 

ACT GTG GCT GAG CAT TTT GCT GGC GAA CTG SAC ACC TS3 CAC 'GTT CTC AAT 
Thr Val Ala Glu Kis Phs Ala Gly Glu Lau Asp Sar Tr? Asp Val Val Asn 

GAA GTG CTG GAG CCG GGC TCC AAT GGT TGC TGG CGT GAA AAC TCT CTG TTC 
Glu Val Leu Glu Pro Gly Sar Asn Gly Cya Tup Arj Glu Aar. Ser Leu She 

TAC CAG AAG CTT GGC AAA GAC TTT GTC GCG AAC GCA TTC CGT GCA GCT CGC 
Tyr Gin Lys Lau Gly Lys Asp ?he Val Ala Asn Ala Ph« Arg Ala Ala Arj 



GAG GGC GAT CCC AAT GCA GAC TTC- 
Glu Gly Asp Pro Asn Ala Asp Leu 

GGT GTA ACT TCC GAT GAG AAG TTC 
Gly val Thr Sar Asp GIu Lys Phe 

CTT C7G GAA GCG GAC GTG CCG ATT 
Leu Leu Glu Ala Asp Val Pro II* 



TAT TAC AAC GAT TAG TCG ACT GAA AAT 
Tyr Tyr Asn Asp Tyr Ser Thr Glu Asn 

AGT TOT TTG TTG GAA CTA GTC GAT GAG 

Sar Cys Leu Leu Glu Leu Val Asp Glu 

ACA GGT GTT GGT TTC CAA ATG CAC GTG 
Thr Gly Val Gly Phe Gin Met Kis Val 



CAG GCG ACG TGG CCT AGC AAT GCC 
Gin Ala Thr Trp Pro Ser Asn Ala 

GCG GAT CCC GGT CTG AAA GTT AAA 
Ala Asp Arg Gly Leu Lys Val Lys 

AAC CCT TAC GGA ACC ACT AAT TTC 
Asn Pro Tyr Gly Thr Thr Asn Phe 

GCC GCC GAG CTG CAG AAG CAG CGC 
Ala Ala Ciu Leu Gin Lys Gin Arg 

GAT AAC GTA CCG GCC AAC CTG CGT 
Asp Asn Val Pro Ala Asn Leu Arg 



AAC ATC GGC AAG GCA TTC AAA GCC ATC 
Asn lie Gly Lys Ala Phe Lys Ala lie 

ATT TCT GAG CTC GAT GTT CCT GTT AAC 
lis Ser Glu Leu Asp Val Pro val Asr. 

CCC CAA TAC AGC AGT TTT ACC GCG GAA 
Pro Gin Tyr Ser Ssr Phe Thr Ala Glu 

TAC AAG GGC ATT ATG CAA GCG TAC .CTT 
Tyr Lys Gly ILe Met Gin Ala Tyr Leu 

GGT GGT TTC ACC GTG TGG GGC GTT TGG 
Gly Gly Phe Thr Val Trp Gly Val Trp 



GAT GGC GAT AGC TGG ATC ATG ACG TTC AGC CAG TAC ACC AAC GCT AAC GCC 

Asp Gly Asp Ser Trp He Met Thr Phe Ser Gin Tyr Thr Asn Ala Asr. Ala 

AAC GAC TGG CCA CTG TTG TTC ACC GGG CCG TAA B48 

Asa Asp Trp Pro Leu Leu -Phe Thr Gly Pro END 
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ATG GGA ACA TCT CTT ATG ATC AAA 'TCT ACA CTG ACA GCT ATG ATT ACT CCT 

Mec Gly Thr Sar Lau Mac lie Lys Sar Thr Leu Thr Giy Mec tie Thr Ala 

GTT GCC GCC GCA GTT'tTC *ACC ACC TCT GCA CCT TTC GCG GAT GTA CCT CCG 
val Ala Ala Ala Val Phe Thr Thr Ser Ala Ala Phe Ala As? val Prs Pr= 

TTG ACA GTG AGC GGA AAT CAG GTT TTA ACT GGC GGT GAA GCA AAA AGC TTC 
Lau Thr Val Ser Gly Asn Gin Val Lau Ser Gly Gly Glu Ala lys Sar Pha 

GCT GGT AAC AGC TTC TTT TGG AGC AAT ACC GGA TGG GGC CAG GAA CGT TTT 
Ala Gly Asn Sar Phe Pha Trp Sir Asn Thr Gly Trp Gly Gin. Glu Ar? Pha 

TAC AAC GCA GAA ACT GTG CGT TGG TTG AAA GAC GAC TGG AAC GCA ACC ATT 
Tyr Asa Ala Glu Thr Val Arj Trp Leu Lys Asp Asp Trp Asn Ala Thr :is 

GTC CGC GCC GCT ATG GGC GTA GAC TTT GAT GGC AGC TAT ATC CCC GAG CAT 
Val Arg Ala Ala Mec Gly val Asp Ph* Asp Gly Ser Tyr II* Pro Glu Kis 

GAA GAC GCC GAC CCC GAG GGT AAC GTC GCT CGC GTA CGT GCA TTG GTG GAT 
Glu Asp Ala Asp pro Glu Gly Asn val Ala Arg Val Ksj Ala Leu Val Asp 



GCA GCC ATC GCA GAA GAC ATG TAC 
Ala Ala lie Ala Glu Asp Met Tyr 

GCA GAA GAT TAC CAA GCC GAA TCT 

Ala Glu Asp Tyr Gin Ala Glu Sar 

CTG TAC GGT GGG TAG GAC AAT GTT 

Lau Tyr Gly Gly Tyr Asp Asa val 

CAA ATC AGC TGG GAC AAT GTT ATT 
Gin lie Ser Trp Asp Asn Val Ha 

GCT ATC CGC GCA ATC GAC CCG GAC 
Ala lie Arg Ala Ila Asp Pro Asp 

TGG TCA CAG GAC GTG GAC GCC GCT 
Trp Sar Gin Asp Val Asp Ala Ala 

AAT ATT GCG TAC ACC CTG CAC TTT 

Asn lie Ala ryr Thr Leu Kis ?he 

CGC GAT AAA GCG CGT AAC GCT ATG 
Arg Asp Lys Ala Arg Asn Ala Met 



GTG ATT ATC GAT TTT CAC ACT CAC CAC 
Val lie lie Asp Pho His Thr His His 

ATC GAG TTC TTC GAA GAA ATG GCC ACA 
Ila Glu Pile Phe Glu Glu Mas: Ala Thr 

ATT TAT GAA ATC TAT AAC GAG CCC CTG 

Ila Tyr Glu Ila Tyr Asn Glu Pre Lau 

AAA CCT TAT GCA GAA TCG GTG ATT GC-C 
Lys Pro Tyr Ala Glu Sar Val Ila Gly 

AAC CTG ATT ATC C-TC GGC ACG CCC ACT 
Asn Leu lis XI* Val Gly Thr Pro Thr 

GCG CGC AAT CCA ATC ACC AGC TAC AGC 
Ala Arg Asn Pro lie Thr Sar Tyr Sar 

TAC GCA GGC ACT CAC GGT TCA TGG TTG 

Tyr Ala Gly Thr His Gly Sar Trp Leu 

AAC AGT GGT ATT GCG CTG TTT GTG ACT 
Asn Sar Gly Ila Ala Lau Pha Val Thr 



GAG TGG GGC ACC GTT AAT GCA GAT GGC GAT GGT GCG CCT GCA GTT AAC GAA 
Glu- Trp Gly Thr Val Asn Ala Asp Gly Asp Gly Ala Pro Ala Val Asn. Glu 

ACT CAG CAA TGG ATG GAC TTC CTC AAG CAG AAC AAT ATC TCT CAC TTG AAC 

Thr Gin Gin Trp Mec Asp Phe Leu Lys Gin Asn Asn lie ssr His Lau Asr. 

TGG TCC GTG AGT GAT AAA TTG GAA GGT GCG TCT ATC GTA CAA CCT GGC ACG 
Trp Ser val Sar Asp Lys Leu Glu Gly Ala ser lie Val Gin Pro Gly Tf.r 

CCC ATT AGC GGC TGG AAC OCT TCT GAC CTT ACG GCC TCC GGC ACA CTG GTT 
Pro Ha Sar Gly Trp Asa Ala Ser Asp Lau Thr Ala Ser Gly Tar Leu Val 

AAG AAC ATC GTT TCC AAC TGG GGC ACC ACA ATC GGT AAC GGC AGC TCC TCA 
Lys Asa He Val Ser Asa Trp Gly Tar Thr lie Gly Asa Gly Ser Ser Ser 

AGT TCA TCC AGC TCC TCT TCC AGC TCT TCA AGC AGT TCT TCT TCG M3C AGT 
Sar Ser Sar Ser Ser Sar Ser Ser Ser Ser Ser Sar Ser Ser Ser Ser Ser 

TCC TCC TCC AGC AGC TCT TCC TCG TCA AGC AGC TCC GGA TCA ACT GGT GGC 
Ser Ser Ser Ser Ser Ser Ser Sar Ser Ser Ser Ser Gly Ser Thr Gly Gly 

GGC AAC TGT GCT GGA GTG AAT GTG TAC CCG AAC TGG ACC GCG CGT GAC TGG 



Gly Asn Cys Ala Gly Val Asa Val Tyr 
TCP GGC GGC GCC TAC AAG CAT GCG AAC 

Sar Gly Gly Ala Tyr Asn His Ala Asn 

AAC AGC CTG TAT CGT GCC AAC TGG TAC 
Asn Ser Leu Tyr Arg Ala Asn Trp Tyr 

GCC TCC TGG ACT AGC CTT GGC GCC TGC 
Ala Ser Trp Thr Sar Lea Gly Ala Cys 

TCC AGC TCA AGC AGC TCC TCG TCA AGC 
See Ser Ser 3«r Ser Sar Sar Sar Ser 

TCG TCT ACT GGC GOT GGC TCC AGC TCC 
Ser Ssr Thr Gly Gly Gly Ser Ser Sar 

TCG TCT TCC AGC AGC TCT AGC AGC ACT 

Ser Ser Ser Ser Ser Ser Ser Ser Thr 



Pro Asn Trp Thr Ala Arg Asp Trp 

CCT GGC GAC CAA ATG GTC TAT CAA 
Ala Gly Asp Gin Met Val Tyr Gin 

ACC AAC AGC GTG CCT GGC AGC GAC 
Thr Asr. sar val Pro Gly Sar As? 

GGA GGC AAC GGA AGT ACG ACC TCA 
Gly Gly Asn Gly Ser Thr Thr Sar 

AGC AGC TCT TCT TCC AGC AGC TCC 
Ser Ser Sar Ser Ser Ser Ser Sar 

TCC AGC AGT TCA TCT TCT TCA TC3 

Ser Ser Ser Ser Ser Sar Ser Sar 

GGT GGC GGT CAA TGT ACC GAA GTG 

Gly Gly Gly Gin Cys Thr Glu val 



TGC AAC TGG TAC GGT CAG GGA ACC TAC CCA CTG TGT AAC AAC ACC AGT GGT 
Cys Asn Trp Tyr Gly Gin Gly Thr Tyr Pro Lau Cys Asr. Asr. Thr S?r Gly 



TGO GGT TGG GAA AAC AAT CAG AGC TGT ATC GGC CGT CAA ACC TGT GAG TCA 
Trp Gly Trp Glu Asn Asn Gin Ser Cys lie Gly Arg Gin The Cys Glu Ser 

CAG AAC GGT GGC GCT GGC GGC GTG GTG AGC AAC TGC ACC GGT TCG ACT ACA 
Gin Asn Gly Gly Ala Gly Gly val val Ser Asa Cys Thr Gly ser Ser Thr 

TCC AGC AGC TCC TCT TCC AGC AGT AGT TCT TCC TCA AGT AGC AGC TCC AGT 
Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 

TCA TCC AGC AGC TCT TCA TCT GGC ACT GGT AGC AGT ACA TCT TCC AGC AGC 
Ser Ser Ser Ser Ser Ser Ser Gly Thr Gly Ser Ser Thr Sar Sar Ser Ser 

AGC TCT TCC AGC AGC TCC AGC TCA AGT ACC GGT TCC TCC GGT ATG CCT GGA 
Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Gly Ser Ser Gly Me; Pro Gly 

CCA CGC GTG GAC AAC CCC TTC GCC GCT GCG CAG AAG T33 TAC ATA AAC CCA 
Pro Arg Val As? Asa Pro Phe Ala Ala Ala Gin Lys Trp Tyr lie Asn Pro 

ATG TGG TCA GCG AGT GCT GCA AAC GAA CCC GGC GGC TCT GTC ATT GCC AAC 
wee Trp Ser Ala Ser Ala Ala Asn Glu Pro Gly Gly Ser val lie Ala Asr. 

GAA CCC TCG TTT GTA TGG ATG GAC CGT ATC GGC GCA ATC GAA GGG CCT GCT 
Glu Pro Ser Phe Val Trp Met Asp Arg lie Gly Ala lie Glu Gly Pro Ala 



GAC GGT ATG GGC CTC CGC GAC CAC TTG 
Asp Gly Me; Gly Leu Arg Asa His Leu 

GAC CTG TTC ATG TTT GTT GTG TAC GAC 

Asp Lau Pha Mac Ptie Val Val Tyr Asp 

CTC GCC TCC AAC GGT GAA CTG CGC ATC 
Leu Ala Ser Asn Gly Glu Leu Arg Ila 

AAG TCC GAC TAC ATC GCA CCT ATC GTT 
Lys Sar Asp Tyr Ila Ala Pro II* Val 

GCA GGT ATC AAA ATC GCT GCG GTT ATC 
Ala Gly Ila Lys lis Ala Ala Val Ila 

GTT ACC AAT CTG AGC GAA CCT GAC TGT 
Val Thr Asn Lau S«r Glu Pro Asp Cys 

TAC CGC GAC GGC ATT CGT CAC GCT ATC 
Tyr Arg Asp Gly Ila Arg His Ala He 

GTA TAC TCC TAC GTG GAT ATT GCA CAC 



AAC GAA GCC CTT GCA CAA GGC GC~ 
Asa Glu Ala Leu Ala Gin Gly Ala 

CTG CCA AAC CGT GAC TGT GCT GCA 

Lau Pro Asr. Arg Asp Cys Ala Ala 

TCC GAA GAT GGC TTC AAC ATC TAC 
Sar Glu Asp Gly Pha Asn Ila Tyr 

GAA ATC ATC AGC GAC CCT GCA TAC 
Glu Ila Ila Ser As? Pro Ala Tyr 

GAG GTG C-AC TCA CTG CCT AAC CTG 

Glu Val Asp =ar Lau Pro Asr. Lau 

CAG GAA GCA AAT GGT CCT GGC GGC 
Gin Glu Ala Asn Gly Pro Gly Gly 

ACT GAA CTG GGC AAA ATC CCC AAC 
Thr Glu Lau Gly Lys lie ?rc Asr. 

TCA GGC TGG CTG GGC TGG AAC GAC 



Val Tyr Ser Tyr Val Asp lift Ala Kis 

AAC TTC GCG CAA GGC GTT AAC CTG ATT 
Asn Phe Ala Gin Gly Val Asn Leu lis 

TCC GGC ATT AAC CCA ATC GCC GGT TTC 
Ser Gly lie Asn Pro lis Ala 01/ Phe 

CCT GTG GAA GAA CCC TTC TTG CCA GAC 
Pro Val Glu Glu Pro Phe Leu Pro Asp 

CCC GTT CGC TCT TCC GAT TTC TAT GAG 
Pro Val Arg Ser Ser as? sue Tyr Glu 

CCC TTC GTG ACC GAT TGG CGT TCT GCC 

pro Phe Val Thr Asp Trp Arg Ser Ala 

TCC ATC GGT ATG CTG ATC GAT ACC GCA 
Ser II a Gly Met Leu lie Asp Thr Ala 

CGT CCA ACT GCG CAG TCT ACC TCC AAC 
Arg Pro Thr Ala Gin Ser Thr Ser Asn 



Ser Gly Trp Leu Gly Trp Asn A3p 

TAT GAA GTG GTT GCC AAC CTC GGT 
Tyr Glu Val Val Ala Asn Leu Gly 

GTC AGT AAC TCC GCT AAC TAC ACG 
Val Ser Asn Ser Ala Asn Tyr Thr 

GCC AAC CTG CAG GTC GC-T GGT CAG 
Ala Asn Leu Gin Val Gly Gly Glr. 

TGG AAC AGC TAC CTG GCA GAG AAA 
Trp Asn Ser Tyr Leu Ala Glu Lys 

ATG ATC TCG AAA GGT ATG CCA AGC 

Mec He Ser Lys Gly He; ?r= Ser 

CGT AAC GGC TGG GGT GGC CCT GAG 
Arg Asn Gly Trp Gly Gly Prs Glu 

AAC CTG AAC ACC TTC GTT AAC GAA 

Asn Leu Asn Thr Phe val Asn Glu 



TCA CGT ATC GAC CGT CCT GAG CAC 
Ser Arg I La Asp Arg Arg Glu Kis 

GGT GTC GGC TAC CGT CCA ACC GCT 
Gly Val Gly Tyr Arg Pro Tfcir Ala 

GTT TGG GTG AAA CCA CAG GGT GAG 
Val Trp Val Lys Pro Gin Gly Glu 

GAG ATC GAT CCT AAC GAC CCG AAC 

Glu lie Asp Pro Asn. Asa Pro Asp. 

TTC GCC AGC AAC TCG TCC AAC ACT 

?iie Ala Ser Asn Ser Ser Asr. Ser 

GCT CCG CAC GCT GGT CGC TGG TTC 

Ala Pro Kis Ala Gly Arg Tro Phe 



CCC GGC AAC TCG TGT AAC CAG CCT GGT 
Arc Gly Asn Trp Cys Asn Glu Pro Gly 

GCA CCT TCT CCA GGT ATT GAT GCC TAC 
Ala Pro Ser Pro Gly lis Asp Ala Tyr 

TCT GAC GGT GTT TCC GAT CCT AAC TTC 
Ser Asp Gly Val Ser Asa Pra Asn Phe 

AAA CAG CAC GAC CCA ATG TGT GAT CCG 
Lys Gin His As? Pro Mec Cys As? Pro 

GCA TAC GGC ACC GGC GCT ATG CCA AAT 

Ala Tyr Gly Thr C-ly Ala Me: Pra Asr. 

CCT GAA GCC TTC CAG TTA CTG CTT GAA 

Pro C-lu Ala Phe Gin. Leu Leu Leu Glu 



AAC GCT TAC CCA CCA ATT AAC TAA 3032 
Asn Ala Tyr Pro Pro He Asn END 
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ATG AAC AAG AAG TCC TCC AAA GAA GCC GTG GTG TAT CAA GTC TAG CCG CGG 

M«c Asa Lys Lys Tr? Trp Lys Glu Ala Val val Tyr Gia Val Tyr Pro Arg 

AGC TTC AAA GAC AGC" AAT GGA GAT GGT GTA <K5C SAT CTG CCT GGG GTT ATT 

Sar Phe Lys As? Sar Asn Gly Asp Gly Val Gly Asp L&j. P r3 Giy Val lie 

GAA AAG CTT GAT TAC ATC AAA AGC CTT GGG GTG GAT GTT ATC TGG CTA TEC 
Glu Lys Leu Asp Tyr II a Lys Ser Lau Gly Val As? Val lie Tr? Leu Cys 

CCS GTG TAC GAT TCC CCC AAT GAT GAC AAT GGT TAC GAT ATT GST GAC TAC 
Pro Val Tyr As? Ser Pro Asa Asp Asp Asn Gly Tyr Asp He Arg A3p Tyr 

TAC GAT ATC ATG GCT GAT TTC GCC ACG ATC GCT GAT TTT GAT GAG CTG CTC 
Tyr As? lie Mec Ala As? Phe C-ly Thr Mec Ala As? Pha As? Glr. Leu Leu 

GAC GGA ATA CAT CAG CCT GGG ATG AAA CTG CTA ATC GAC CTG GTG GTA AAC 
Glu Gly He His Gin Arg Gly Ma; Lys Leu Leu Mec Asp Leu val Val Asz 

CAC TGC TCT GAT GAG CAC AAA TOO TTT CAG GAG TCC CGC AAG ACT AAA GAC 

His Cys Ser Asp Glu His Lys Trp Phe Gin Glu Ser Arj Lys Ser Lys Asp 



AAC CCT TAC CGG GAC TAC TTC ATC TGG AAG CCT GCC AAA AAC GGA GGC CCA 
Asn Pro T/r Arg Asp Tyr Phe lis Trp Lys Pro Gly Lys Asn Gly Giy Pro 

CCT AAC AAC TGG CAG TCC TTT TTT AGT GGT AAT GCC TGG GAA TAC GAT GAG 

sro Asn Asn Trp Gin ser Phe Phs Sar Gly Asn Ala Trp Glu Tyr Asp Glu 

GCC ACT GAC GAG TAT TAC CTA CAT CTT TTC ACC AAA AAG CAA CCA GAC CTC 
Ala Thr Asp Glu Tyr Tyr Leu His Lau Phe Thr Lys Lys Gin Pro Asp Lau 

AAT TGG GAA AAC CCG AAA GTA CGT GAG GAG GTG CAC AAG CTG ATG AAG TAT 
Asn Trp Glu Asn Pro Lys Val Arg GLu Glu Val His Lys Lau Mat Lys Tyr 

TGG CTG GAC AAA GGA GTA GAT GGG TTC CGG ATG GAT GTG ATT TCC GTG ATT 
Trp Lau Asp Lys Giy Val Asp Gly Phe Arg Mas Asp Val Ila Sar Val lie 

TCA AAA AGA AAC TTC GAA GAT TCA CCT TAC AAG GAC TTC AAC AAG ACC ATC 
Ser Lys Ar5 Asn Phe Glu As? Sar Pro Tyr Lys As? Phe Asn Lys Thr lie 

GAT AAC GTC TAC GCC AAT GGC CCG CGT GTG CAG GAG TTT CTC CAG GAA ATG 

Asp Asn Vai Tyr Ala Asn Gly Pro Arg Val Gin Glu Phe Lau Gin Glu Mec 

AAC CGT GAA GTA CTG AGT AAG TAC GAT GTG ATG ACA GTA GGT GAG GGT CCA 

Asn Arg Glu Val Leu Sar Ly3 Tyr Asp Val Met Thr Val Gly Glu Giy Pro 



got arc aat ctg gaa agc ggc ctc 
Gly He Asn Leu GLu Ser Giy Leu 

CTT AAT ATG ATT TTT CAT TTT GGG 
Leu Asn Met lie Phe' His Phe Giy 

GGT AGA TTT GAT CCC AAG CCC ATC 
Gly Arg Phe Asp Pro Lys Pro lie 

AGG CTG TGG GAT GAG TAC CTT AAA 

Arg Leu Trp Asp Glu Tyr Leu Lys 

GGG AAT CAT GAT TTT CAG CGA ATC 
Gly Asn His Asp Ph» Gin Ara He 

TAC TGG AAA GAG TCC GCC AAA CTG 
Tyr Trp Lys Glu Ser Ala Lys Leu 

GGC ACG GTC TAC GTT TAC CAG GGT 
Gly Thr val Tyr Val Tyr Gin Gly 



CAA TAT GTA TCC AGC TCA GCG GAG GCT 

Gin Tyr Val Ser Ser Ser Ala Glu Ala 

CAC ATG TTT ATG GAT CAT GGA CCC GGA 

Kis Mec Phe Met Asp His Gly Pro Gly 

GAT TTT CTG GAA TTC AAA AAA GTC TTC 
Asp Phe Leu Glu Phe Lys Lys Val Phe 

GAA GAG GGC TGG GGT AGC GTC TTT CTA 

Glu Glu Gly Trp Giy Ser Val Phe Lau 

GTT TCT CGC TTT GGG GAT GAC GGA C-CG 

Val Ser Arg Phe Giy Asp Asp Gly Ala 

CTG AGC TTG TTG CTA TTT AGC ATG CGC 
Leu Ser Leu Leu Leu Phe Ser Ms: Arg 

GAT GAA ATA GGT ATG ACC AAT GTG GCT 
Asp Glu He Gly Met Thr Asn Val Ala 



TTT GAC ACC ATA GAA GAA TAT GAC GAT GTG GAG ATC AAA AAT GCT TAC AAG 



Phe Asp Thr lie Glu Glu Tyr Asp 

GAG TGG AAA GCT GAA GGA AAA GAC 
Glu Trp Lys Ala Glu Gly Lys Asp 

ATC AAT GGC CGT GAC AAT GCC CGT 
!le Asr. Gly Arg Asp Asa Ala Arg 

CAG GCT GGT TTT ACC TCA GGC ACT 
Gin. Ala Gly Phe Thr Ser Gly Thr 

ACQ GCA ATC AAT GTG GCT AGT CAG 
Thr Ala Ha Asn Val Ala Ser Gin 

TTT TAT CSC CGG ATG C-T3 GCG ATG 
Phe Tyr Arg Arg Mac Val Ala Mec 

GGT GAT TTT GCC CCC ATT CAG GAA 
Gly Asp Pha Ala Pro lie Gin Glu 



Asp Val Glu lie Lys Asn Ala Tyr Lys 

CTG GAT CAG TTT TTA AAG AAC GTC CAT 
Leu Asp Gin Phe Leu Lys Asr. Val His 

ACA CCG CTG CAA TGG AAT GAT GCT GAG 
Thr Pro Leu Gin Tr? Asr. Asp Ala Glu. 

CCA TGG CTC AAA GTC AAC OCT AAC TAT 
Pro Trp Leu Lys Val Asn Pro Asr. Tyr 

GAA GGA GAT GAG AAC TCT ATT CTG GCA 
Glu Gly Asp Glu. Asr. Sar Ha Lau Ala 

CGA AAG GAG CAC CCG ACA CTT GTT TAT 
Arg Lys Glu His Pro Thr Leu Val Tyr 

GAT CAT CCG AGT GTA TTT GCT TTT TGG 
Asp Kis Pro Sar Val Phe Ala Fha Trp 



AGA TGG GAT GAA GAG GCT GCA TAT TTA GTC TTA CTC AAT TTT TCT GAG GAG 
Arg Trp Asp Glu Glu Ala Ala Tyr Leu Val Leu Leu A33. Phe Ser Glu Glu 



ACT CAG GAA TTT GGC CTG GAC GAT CGA 
Thr Gla Glu Phe Gly Leu Asp Asp Arg 

GTA GAG GCC AAT GAC TTT GAC TTT GGT 

Val Glu Ala Asn Asp Phe Asp Phe Gly 

CTA AAA CCG TGG CAG GCG GTG TTG GCG 
Leu Lys pro Trp Gin Ala Val Leu Ala 



TTT GAT AGT ACT AAG CTT CGC ATA 

Phe Asp Ser Ser Lys Leu Arg lie 

GAG CCA CAA AGT GGA AAA GTG AAA 
Glu Pro Gin Ser Gly Lys Val Lys 

iss: 

CGT GTT CGG CAT ATT GAA TTG TAA 
Arg Val Arg His lie Glu Leu SSD 
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TCT TCT GAA CCA TTC TCC ACT GAG CAG AAA AGA CCA GAT CAT ACT CTT TGT 
5«r S«r Glu Arg Ph«s S*r Thr Glu Gla Lys Arg sro Aso Kis Thr Leu Cys 

GGA C3G AAA AGA ACA TTC GCC AAA GAA GOT GGT TAT ACC ACC CTT CAA AGA 
Gly Arg Lys Arg Thr Pha Gly Lys Glu Gly Gly Tyr Tiir Tir Lau Gin Arg 

GGA AAC GCT GGT CTT CAA ACT GAA CGG ACT GAA GAG GGG AGA GCA CCT CGT 
Gly Asa Ala Gly Lan GLn. S«r Glu Arg Thr Glu Clu Gly Arg Ala Prs Arg 

ATC CAC CAG TCT GAA CAC GGG AAA AAC CAT CTA TGT GAG GT3 ATC TGT GTG 
tla Kis Gla S« Glu Els Gly Lya Asa His Lau Cys Glu Val II* Cys Val 

GAG ATC TTC AAA AGA CCG TTC AGA GAA GGG AGC TTC GTT CT3 AAA GAS AAG 
Glu II* 9b* Lys Arg Pro Sis* Arg Glu Gly Sar Pha Val Lau Lys Glu Lya 

GAC TAC ACC GTT GAG TTC GAG GTG GAG AAG ATC CAT CTT GGA TGG AAG ATT 
Asp Tyr thr Val Glu Pae Glu Val Glu Lys 21a His L«u Gly Tr? Lys lift 

TCA GGG AGA GTG AAG GGA AAT CCC GGA AGG CTT GAG ATC TTT CGG ACA AAC 
S«r Gly Arg Val Lys Gly Asa Pro Gly Arg L«u Glu Ii» Pha Arg Thr A3R 



GCA CCG AAG AAA CTC CTC GTG AAC 
Ala Pro Lys Lys Leu Leu Val Asn 

GTG GTG GAT CTT CCA TCC TTC ACC 
val val As? Leu Pro ser Phe Thr 

TAC ACG GCC 7CT GTS GTA CCG GAT 
Tyr Thr Ala Ser Val Val Pro Asp 

TAC TTC GTG GCA GAG GAA GGG AGA 
Tyr Phe Val Ala Glu Glu Gly Arg 

GCA CAT CCT TTC TTT GCG GCA GAG 
Ala His Fro Pha Pha Ala Ala Glu 

TAC TTC GAT GTG AAT TTC GAT GAC 
Tyr Pha Asp Val Asn. Pha Asp Asp 

CTT GAA AAT CCA ATC ACC TCT CTC 
Leu Glu Asa Pro lie Thr Ser Leu 

GGG AAG GAA AAC AGC GCG AGG ATT 
Gly Lys Glu Asn. Ser Ala Arg He 



AAC TGG CAG TCC TGG GGA CCC TGC AGG 
Asn Trp Gin Ser Trp Gly Pro Cy3 Arg 

CCA CCC GAG ATA GAT CCA AAC TGG CAG 

Pro Pro Glu lie Asp Pro Asn Trp Gin 

GTG ATC AAA AAC CGT CTT CAG AGT GAC 
Val He Lys Asn Arg Leu Gin Ser Asp 

GTA TAC GGT TTT TTG AGT TCG AAG ATC 
Val Tyr Gly Pha Lau Sar Sar Lys He 

AAT GGA GAA CTT GTT GCG TAT CTT GAG 
Asn Gly Glu Lau Val Ala Tyr Lau Glu 

TTC GTC CCG ATA GAA CCT TTT GTC GTC 
Phe Val Pro lie Glu Pro Phe val Val 

CTT CTG GAA AAG TAC GCT GAA CTC GTC 
Leu Leu Glu Lys Tyr Ala Glu. Leu Val 

CCA AAA CGT ACA CCG GTT GGA TGG TGC 
Pro Lys Arg Thr Pro Val Gly Trp Cys 



AGC TGG TAC CAC TAT TTC CTC GAT CTC ACC TGG GAG GAG ACT TTG AAG AAT 
Ser Trp Tyr His Tyr Phe Leu Asp Leu Thr Trp Giu vilu Thr Leu Lys Asn 

CTG GAA CTT GCA GGA GAG TTT CCC TTC GAG GTC TTT CAG ATA GAC GAC GCG 
Leu Glu Leu Ala Gly Glu Phe Pro Phe Glu Val Phe Gin. He as? Asp Ala 

TAT GAA AAA GAC ATC GGA GAC TGG CTC GTC ACG AAG AAA GAC TTC CCA TCT 
Tyr Glu Lys Asp lie Gly Asp Trp Leu Val Thr Lys Lys Asp Phe Pro Ser 

GTG GAC GAG ATG GCA AGG ACG ATA CAG GAG AAA GGC TTT GTT CCT GGT ATA 
val Asp Glu Met Ala Arg Thr lie Gin. Glu Lys Gly Phe Val Fro Gly Ha 

TGG ACC GCA CCS TTC AGT GTT TCA GAA ACA TCG GAT GTG TTC AAC TCC TAT 
Trp Thr Ala Pro Phe ser val Ser Glu Thr Ser Asp Val Phe Asr. Ser Tyr 

CCG GAC TGG GTC GTG AAG GAA AAC GGA ATG CCA AAG ATG GCG TAC AGG AAC 
Pro Asp Trp Val Val Lys Glu Asn Gly Met Pro Lys Met Ala Tyr Arc Asa 

TGG AAC AGA AAG ATC TAC GCT CTT GAC CTT TCA AAC AAA GAA GTC CTG GAC 
Trp Asn Arc Lys He Tyr Ala Leu Asp Leu Ser Asa Lys Glu Val Leu Asp 

TGG CTC TTC GAC CTC TTC AGC TCT CTC AAG AAG ATG GGC TAC AGA TAC TTC 



Trp Leu Phe Asp Lau Phe Sar Ser L«u 

AAG 'ATC GAC TTT CTC TTT GCA GGA GCG 
Lys lie Asp Phe Leu Phe Ala Gly Ala 

ATC ACA CCC GTT CAG GCG TTC AGA AAG 
lie Thr Pro Vdi Glr. Ala pfce Arg Lys 

GTT GGA GAC TT3 TTC ACA CTC GGA TGT 
Val Gly As? Lau ?iie 11* Lau Gly Cys 

GGC TAC GTT GAC 6GC ATG AGG ATA GGG 
Gly Tyr Val Asp Gly Mac Arg He Gly 

GAT CAA ATA GAA GAC AAC GGA GCA CCC 

Asp Gin Ila Giu Asp Asn. Gly Ala Pro 

GCC ATC ACA CGT TAC TTC ATG CAC GAC 
Ala II a Thr Arg Tyr Phe Mac His Asp 

TGC CTC ATC CTG AGA GAG GAA AAA ACA 
Cys Lau Ha Leu Arg Glu Glu Lys Thr 



Lys Lys Mac Gly Tyr Arg Tyr Phe 

ATT CCG GGT GAG AGG AAA GAA AAC 
Ila Pro Gly Glu Arg Lys Glu Asn 

GGG ATG GAG GTG ATC AGA AAG GCG 
Gly Mei Glu Val Zlt Arg Lys Ala 

GGC TCT CCC CTT CTT CCT GCG GTG 
Gly Ssr Pro Leu Lau Pro Ala Val 

CCG GAC ACC ACA CCC TTC TGG GGT 
Pro Asp Thr Thr Pro Phs Trp Gly 

GCT GCA AGA TGG GCT CTG AGA AAT 

Ala Ala Arg Trp Ala Lau Arg Asn 

AGA CTC TGG CTG AAC GAT CCG GAC 
Arg Lau Trp Lau Asn Asp Pro Asp 

GAA CTG ACC CCA AAA GAG AGA GAG 
Glu Lau Thr Pro Lys Glu Arg Glu 



CTC TAC TCG TAC ACC TGT GGG ATC 
Leu Tyr Ser Tyr Thr Cys Gly- lie 

GAC CTG TCA CTT GTG AAA GAG CAC 
Asp Leu Sar Leu Val Lys Glu His 

GAT CTT CTC GGG GGA AAG CCC CGT 

Asp Leu Leu Gly Gly Lys Pro Arg 

AAG TAC GAG ATC GTC TCG TCT GGC 
Lys Tyr GLu He Val Ser Ser Gly 

GTC GAT CTC AAA AAC AGA GAG TAC 
Val Asp Leu Lys Asn Arg Glu Tyr 

CTG AGA AAG AAG GTT GTC AAA AGA 
Leu Arg Lys Lys Val Val Lys Arg 

GAA GAG GGT GAG AGA GAA TGA 
Glu Glu Gly Glu Arg Glu EUD 



CTC GAC AAC ATC- ATC ATA GAA ACT GAC 
Leu Asp Asn Met lie lie Glu Ser Asp 

GGA AGG AAG GTT CTG AGA GAG ACA CTC 
Gly Arg Lys Val Leu Arg Glu Thr Leu 

GTT CTG AAC ATC ATG ACA GAG GAT CTG 
Val Leu Asn Ha Me; Thr Glu Ass Leu 

ACS ATC TCT GGA AAC ACC AGG CTC GTT 
Thr He Ser Gly Asa Thr Arc Leu Val 

CAT CTG GAA AAA GAG GGA AAG TCC TCT 

His Leu Glu Lys Glu Gly Lys Ser Sar 

GAA GAC GGA AGA AAC TTC TAC TTC TAC 

Glu Asp Gly Arg Asa Phe Tyr Phe Tyr 

lass 
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ATG ACA AAA CTT GTC TTC TCA TTT TTG ATT GTG ACA TTG CCC A" GTC CTC 
Mac Arg Lya Lau Val Ph« Sar Phe Leu Ha Val Thr Leu Pea lis Val Lau. 

TTT GCA AAC ACT GAT TTC GTG AAA GTG GAA AAC GGC AGG TTC ATA CTC AAC 
?he Ala Asr Sar As? 9he Val Lys Vai QU Asa Gly Arc ?hs Ua Leu Aa^ 

GGA GAA GAG TTC AGA TTC GTT GGA AGC AAC AAC TAC TAC ATG CAC TAC AAC 
Gly Glu Glu Phe Arg Pha Val Sly Ser Asa Asa. Tyr Tyr t*.ez His Tyr Lys 

AGC AAT CGA ATG ATA GAC AST GTC CTT GAA AGT GCA AAA GCC ATG GGG GTG 
Sar Asa Ary Mas IU Asp Sar val Lau Glu Sar Ala Lys Ala Met Gly Val 

AAG GTG CTC AGA ATT TGG GGA TTC CTC GAT GGT GAG AGT TAC T3= C3T GAC 
Lys Val Lau Ars ria Trp Gly Sha Lau Asp Gly Glu Sar Tyr Cya Ar= Asr 

AAG AAC ACC TAC ATG CAC CCC GCA CCG GGA GTA TTT GGA TTG CCA GAG GGT 
Lys Asa Tar Tyr Mas His Pra Ala Pro Gly Val Phe Gly Lau Prs Glu Gly 

ACS AAC GCT CAG GAC GGT TTT GAA AGA CTC GAC TAC ACS GTA G~ AAA GCA 
Thr Asn Ala Gin Asp Gly Phe Glu Ars Leu Asa Tyr Thr Val Ala Lys Ala 



AAA GAA CTG GGC ATA AAG CTC ATA 
Ly3 Glu Leu Gly He Lys Leu lit 

TTC GGT GGA ATG AAT CAA TAC GTG 
Phe Gly Gly Met Asr. Gin Tyr Val 

GAC TTC TAC AGG AAC GAG AAG ATC 
Asp Phe Tyr Arg; Asn Glu Lys I la 

TTC CTC ATA AAC AGG GTG AAC ACC 
Phe Leu lis Asn Arg Val Asn. Thr 

CCC ACC ATC ATG GCA TGG GAA CTG 
Pro Thr lie Mat Ala Trp Glu Leu 

AAG TCT GGT AAC ACA CTC GTT GAA 
Lys Sar Gly Asn Thr Leu Val Glu 

AAG AGT CTG GAT CCA AAC CAC CTG 
Lys Sar Leu Asp Pro Asn :-:is Leu 

AAC AAC TAC GAA GGC TTC AGA OCT 
Asn Asn Tyr Glu Gly Phe Arg Pro 



ATC GTT CTT GTG AAC AAC TGG GAC GAC 
He Val Leu Val Asn Asn Tr? Asp Asp 

AGA TGG TTT GGG GGC ATC CAT CAC GAT 
Arg Trp Phe Gly Gly Ha His His As? 

AAA GAA GAA TAC AAA AAG TAC GTG TCT 
Lys Glu Glu Tyr Lys Lys Tyr Val Sar 

TAC ACG GGT GTT CCT TAC AGG GAA GAG 
Tyr Thr Gly Val Pro Tyr Arg Glu Glu 

GCG AAC GAG CCC AGG TGT GAA ACG GAC 
Ala Asn Glu Pro Arg Cys Glu Thr Asp 

TGG GTA GAG GAG ATG AGT GCT TAC ATA 
Trp Val Glu Glu Mac «er Ala Tyr lie 

GTT CCC GTG GGA GAC GAG GGA TTC TTC 
Val Ala Val Gly Asp Glu Gly Phe Phe 

TAC GGT GGA GAG GCT GAG TGG GCC TAC 
Tyr Gly Gly Glu Ala Glu Trp Ala Tyr 



AAC GOA TGG TCC GGT GTT GAC TGG 
Asn- Gly Trp Ser Gly Val Asp Trp 

GAT TTT GGT ACG TTC CAT CTC TAC 

Asp Phe Gly Thr Phs His Leu Tyr 

AAC TAC GCA CAG TGG GC-G GCA AAG 
Asn Tyr Ala Gin Trp Gly Ala Lys 

AAA GAG GTT GGA AAA. CCC GTC GTT 
Lys Glu Val Gly Lys Pro Val Val 

GCC CC3 GTC AAC AGG GTT GCC ATT 
Ala Pro Val Asn Arg Val Ala Els 

AAC CTC GGT GGA AAC GGT GCC ATG 

Asa Leu Gly Gly Asn Gly Ala Mec 

GGA TOG GAC AGA GAC GAA AAC GGT 
Gly Trp Asp Arg Asp Glu Lys Gly 

ATA GTG AAC GAT GAA ACT GAA GAG 



AAG AGA CTT CTG GAG ATA GAG ACG GTG 
Lys Arg Lau Leu Glu He Glu Thr Val 

CCC TCC CAC TGG GGT GTG AGC CCT GAA 
Pro Ser Eis Trp Gly Val Sar ?ro Glu 

TGG ATA GAA GAT CAC ATA AAG ATC GCA 
Trp He Glu Asp His Ila Lys He Ala 

CTG GAA GAG TAC GGT ATT CCC AAA ACT 
Leu Glu Glu Tyr Gly He Pro Lys Ser 

TAC AAA TT3 TGC- AAC GAT CTG GTC TAC 
Tyr Lys Leu Trp Asn As? Leu Val Tyr 

TTC TGG ATG CTC GCA GGA ATC GGT GAA 
Ehe Trp Met Leu Ala Gly lie Gly Glu 

TAC TAC CCC GAT TAC GAC GGC TTC AGA 
Tyr Tyr Pro Asp Tyr Asp Gly Phe Arg 

GCA AAG TTG ATC AGA GAG TAC GCG AAA 



lis Val Asn Asp Glu Sar Glu Glu Ala 

CTG'TTC AGC ACG GGT GAG GAT ACG AGG 
Leu Phe Ser Thr Gly Glu Asp Thr Arg 

CCA AAG GAT GGT CAG GAG ATC AAA AAG 
Pro Lys Asp C-ly Gin Glu I la Lys Lys 

TTC GAC TAC AGC AAC ACG TTC AAA GGA 
She Asp Tyr Ser Asn Thr She Lys Gly 

CTC TTT GAA GAT GAG ATA AAA CAT CTC 

Leu Phs Glu Asp Glu lie Lys His Leu 

TTT GAC ACA ACG CGG ATT TCA GAC GGA 
Pfce Asp Thr Thr Arg lie ser Asp Gly 

CAT TTC AGG GGA GAA ACG GTG AAA GAC 
His Phe Arg Gly Glu Thr Val Lys Asp 



Lys Leu lie Arg Glu Tyr Ala Lys 

GAA GAT ACC TGC ATC TTC ATC ACA 
Glu Asp Thr Cys Met Phe He Thr 

ACT GTG AAG GTG AGA GTG GGT GTC 

Thr Val Lys Val Arg Val Gly Val 

ATT TCC GTC GGG GTT GAA AAT CTG 
He Ser Val Gly Val Glu Asn Leu 

GGA TAT GGA GTT TAC GGA TTC GAA 
Gly Tyr Gly Val Tyr Gly Phe Glu 

GAA CAC GAG ATG TTC CTT GAG GCA 
Glu His Glu Hat She Leu Glu Ala 

ACA ATC AGG GTG AAA GTT GTG AAC 
Thr lie Arg Val Lys Val Val Asa 



AGA GCG CAG 
Arg Ala Gin 



TAT GTA CTC GCA GAA GAA GTG GAT TTT TCC AGA CCC GAA GAA 
Tyr Val Lau Ala Glu Glu Val Asp Phe Sar Arg Pro Glu Glu 



GTC AAG AAC TGG TGG AAC AGC GGA 
Val Lys Asn Trp Trp Asn Ser Giy 

GAT ATA GAG TGG AAC GGT GAG GTG 
Asp lie Glu Trp Asn Gly GLu Val 

GTG CTT CCC GGA AAG GGT GAC TGG 
Val Leu Pro Gly Lys Gly Asp Trp 

GAT CAA CTC CCC GTG TGT GAG ATC 
Asp Gin Leu Pro Val Cys Glu lie 

GTT GAA GGG CTT ACA GGA AGG CTC 
Val Glu Gly Lau Thr Gly Arg Leu 

TGG GTG AAG ATA GGG CTC GAC ATG 
Trp Val Lys He Gly Leu Asp Met 

CTT GTC AGT TTC GAT GGC AAA AAG 
Lau Val Ser Phe Asp Gly Lya Lys 

TTC GAC AAG ACA CCT GGA GTG AAC 

Phe Asp Lys Thr Pro Gly Val Asn 



ACA TGG CAG GCT GAG TTC AAA ACA CCC 
Thr Trp Gin Ala Glu Phe Lys Thr Pro 

GGG AAC GGT GCT CTC CAG ATG AAC GTG 
Gly Asa Gly Ala Lau Gir. Met Asr. Val 

GAA GAG GTG AGG GTC- GTC AGG AAA TTC 
Glu Glu Val Arg Val Val Arg Lys She 

CTC GAG TAC GAT ATC TAG ATA CCA GAC 
Lau Glu Tyr Asp He Tyr *ls Pro As? 

AGA CCG TAC GCG GTG CTG AAT CCC GGC 
Arg Pro Tyr Ala Val Leu A3* Pro Giy 

AAC AAC ACC TCG ATT GAC AGC GGA GAA 
Asn Asn Thr Ser lie As? Ser Gly Giu 

TAC AGA AAG TTC CAT GTG AGG ATC GAG 
Tyr Arg Lys Phe Kis Val Arg He Glu 

GAG CTC CAC ATA GGT GTA GTT GGA GAC 
Glu Leu His lie GLy Val Val Gly Asp 



CAC CTG GAG TAT GAT GGG CCG ATT TTC ATC GAT AAT GTG AGG CTC TAT AAA 
His Leu Glu Tyr Asp Gly Pro lie Phe lis Asp Asn Val Arg Leu Tyr Lys 

AAA TCT TCT TGA 2 000 

Lya Ser Ser END 
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ATG CAT TTT AGC CCA CTA CAA TTG ATC CTC GTC TTA GTC ATT GTC ATT CTG 
Mac Kis Phe Ser Pro Leu Gin Leu lis Leu Val Leu Val lie Vai He Leu 

CTG TTT GGC ACC AAA AAA TTA CGC AAT ATG GGC GGC GAT TTA GGC GAA GCC 
Leu Phe Gly Thr Lys Lys Leu Arg Asn Met Gly Gly Asp Leu Gly Glu Ala 

TTC AAG AAT TTC AGA AAA CCA GTC AAA GAC GGC GAT GAT GCT GAA ACA CAA 
Phe Lya Asn Phe Arg Lys Ala Val Lya Asp Gly Asp Asp Ala Glu Thr Glr. 

AAA GAT GTT GCT GTG CAA AAA GTT GAC CAA CAG CCA CCA GCA CAG CCC ATC 

Lys Asp Val Ala Val Gin Lys Val Asp Gin Gin Pra Pra Ala Gin Pra He 

25-1 

CCA CAA GGT CGA GTC ATT GAT TCG GAA GCC AAG GAA AAG GAT AAG C-TC TAA 
Pro Gin Gly Arg Val lie Asp ser Glu Ala Lya Glu Lys Asp Lys Val Sirs 
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ATG . GAA CCA CTT CCA GGA GGT GTG AGG ATG AAG TTC CCA TCT AAC TTT CTT 
Met Glu Giy Leu Arg Gly Gly Vai Arg Mac Lys Ptie Prs Ser Aan Phe Leu 

TT7 GGC TAC TCC TGG TCG GGC TTC CAG TTT GAA ATG GGT TTA CCT GGG AGT 

She Gly Tyr 3er Trp Ser Gly Pile Gin ?tie Glu Met: Gly Leu 9ro Gly sar 

GAA GTT GAG AGO GAC TGG TGG GCA TGG GTC CAC GAT AAG GAG AAC ATC TTC 
Glu val Glu Ser as? Trp Tr? Ala Trp val His Asp Lys Glu Aar. lis sua 

TCG GGC CTA GTT AGC GGT GAC CTA CCA GAG AAC GGG CCT GCT TAC TGG CAC 

Sar Gly Lau val Sar Gly As? Leu Pra GLu Asr. Gly Pro Ala Tyr Trp Kis 

CTC TAC AAG AAA GAC CAC GAC ATA GCT GAA AGC CTT GGC ATG GAC GCG ATA 
Lau Tyr Lys Lys Asa His Asp lit Ala Glu Ser Lau Gly Mac Asp Ala !!• 

AGA GGC GGA ATC GAG TGG GCG AGG ATC TTC CCA AAA CCC ACC TTT GAC GTG 
Arg Gly Gly He Glu Trp Ala Arg 11a She Pro Lys Pro Tsr Phe Asp Val 

AAG GTT GAC GTG GAA AAG GAC GAA AAC GGG AAC ATA ATC TCC ATT GAC GTC 
Lys Val Asp Val Glu Lys Asp Glu Asa Gly Asn He He Ser He Asp Val 



CCG GAG AGC GCG ATA GAG GAG CTA 
Pro Glu Ser Ala lie Glu Glu Leu 

AAC CAC TAC CGC GAA ATC TAC TCG 
Asa His Tyr Arg Glu lie Tyr Ser 

ATA TTG AAC CTC TAT CAC TGG CCC 
He Leu Asn Leu Tyr Kis Trp Pro 

GGC GTT AGA AAG CTC GGC CCT GAT 
Gly Val Arc Lys Leu Gly Pro As? 

AGG AGC GTG GTG GAG TTC ACC AAG 
Arg Sir Val Val Glu Phe Thr Lys 

GAT GAC CTC GTT GAC ATG TGG AGC 
Asp Asp Leu Val As? Met Tro Ser 

GAG CAG GGT TAC ACG AGG CCT CAG 
Glu Gin Gly Tyr Thr Arg Pro Gin 

CAC GAG GCC GCT GGA AAG GCG AAG 
His Glu Ala Ala Gly Lys Ala Lys 



GAA AAG CTT CCC AAC ATG GAT GCC CTC 
Glu Lys Leu Ala Asn Met Asp Ala Leu 

GAC TGG AAG GAG AGG GGC AAG ACC TTC 
Asp Trp Lys Glu Ar= Gly Lys Thr She 

CTT CCC CTC TGG CTC CAC GAC CCG ATA 
Leu Pro Leu Trp Leu His Asp Pro lie 

AGA GCT CCC TCG GGC TGG CTG GAC GAG 
Arg Ala Pro Ser Gly Trp Leu As? Glu 

TTC GCT GCA TTC ATC GCC TAC CAC TTG 
Pile Ala Ala Phe lis Ala Tyr His Leu 

ACG ATG AAC GAG CCG AAT GT5 GTT TAC 
Thr Met Asn Glu pro ksn Val Val Tyr 

TCG GGC TTT CCA CCG GGT TAT CTC AGC 
Ser Gly Phe Pro Pro Gly Tyr Lau Ser 

CTC AAC CTC ATG CAG GCT CAC GCT AGA 
Leu Asn Leu Mat Gin Ala Kis Ala Arg 



GCT TAC GAT GCG ATA AAA. GAG CAC TCG GAC AAG CCC GTG GGG TTG ATA TAC 
Ala Tyr Asp Ala He Lys Glu His Ser Asp Lys Pro Val Giy Leu He Tyr 

TCC TTT GTC TGG CAC GAT GCC CTA AAC GAG GAA GCG GAG GAG ATT GTG AAG 
Ser Phs val Trp Kis Asp Ala Leu Asr. Glu Glu Ala Glu Glu II a Val Lys 

GAG ATA AGG AGG AGA CAC TAC GAC TTC GTA ACC GGC CTT CAC TCC GGC TCA 
Glu lie Arg Arg Arg His Tyr Asp Phe Val Thr Gly Lau Kis Sar Gly Sar 

TCG GAG TTC GGG GAG AGG GAG GAC TTC AAG GGG AAG ATC GAC TGG ATA GGC 
Ser Glu. Phe Gly Glu Arg Glu As? Pha Lys Gly Lys lis Asp Trp lis Giy 

GTG AAC TAC TAC ACT AGG GTT GCT TAC GAG ATG AGG AAC GGC 
Val Asr. Tyr Tyr Thr Arg Val Ala Tyr Glu Met Arg Asn Gly 

GCC CTA CCC GGG TAC GGC TAC ATG TGC GAG AGG AGT C-GT TAC GCA AAA TCC 
Ala Lau Pro Gly Tyr Gly Tyr Met Cys Glu Arg Ser Gly Tyr Ala Lys Ser 

GGA AGG CCC GCG AGC GAT TTT GGC TGG GAG. ACC TAT CCT GAG GGC CTC GAA 
Gly Arg Pro Ala Ser Asp Phe Gly Trp Glu Thr Tyr Pro Glu Gly Leu Glu 

AAC GTC TTG ATG GAT CTG AAG GAG CTC TAC GGC CTG CCA ATG ATG GTG ACG 



C5C TTT ATG 
Arg Phe Met 



Asn Val Lau Mac Asp Lau Lys Glu Leu Tyr Cly Lau Pro Mac Mee Val Thr 



GAG- AAC GGG ATG GC3 GAT ATG GCA 
Glu Asn Gly Met Ala Asp Mac Ala 

AGC CAC CTC GCG GCT ATC CAC AGG 
Ser His Leu Ala Ala lie His Arg 

GGG TAC CTC CAC TGG TCT CTG ACC 
Gly Tyr Leu His Trp Sar Lau Thr 

AGA ATG CGC ITT GGG CTG GTG ATG 
Arg Met Arg ?ha Gly Lau Val Mec 

ATA AGG CCG AGC GCA CTC GTC TTC 
lie Arg Pro Sar Ala Lau Val Pha 

CCC GAA GAG CTC TCC CAC CTA GCG 

Pro Glu Glu Leu Ser His Lau Ala 



GAC AGG CAC CGC TCT TAC TAC CTC GTG 

Asp Arg His Arg Sar Tyr Tyr Lau Val 

GCG ATG GAG AAG GC-T GCC GAC GTT AGG 

Ala Mac Glu Lys Gly Ala Asp Val Arg 

GAC AAC TAC GAG TGG GCG CAG GC-C TTC 

Asp Asn Tyr Glu Tr? Ala Gla Gly Piit 

GTG GAC TTC GAG ACT AAG AAG CGC TAC 

Val Asp Fha Glu Thr Lys Lys Arg Tyr 

AGC- GAG ATA GCC ACS CAG AAG GAA ATA 

Arg Glu II* Ala Thr Glr. Lys Glu :la 

1478 

AAC CTC GAA CTG GTA ACG AAG AAG TAA 

Asn. Lau Glu Leu Val Thr Lvs Lvs END 



AEPIIla (? a-V# S 3 GA4) ^Jai/M 

i 

ATG AAG TTC CCA TCT AAC TTT CTT TTT GCC TAC TCC TGG TCC GGC TT" CAC 
Mac Lys Pha Pro Ser Asn Phe Leu She Gly Tyc Sar Trr> Ser Gly Pie Gin 

TTT GAA ATG GGT TTA CCT GGG AGT GAA GTT GAG AGC GAC TGG TGG CCA TCC 
Pha Glu Mac Gly Leu Pro Gly Sar Glu Val Glu Sar Asa Tr? Tr? Ala Tr? 

CTC CAC GAT AAG GAG AAC ATC TTC TCC GCC CTA CTT AGC C-CT CAC CTA CCA 
Val Hij Asa Lys Glu Asa Zl« Phe Ser Gly Leu Val Sec Gly Asp Leu Pro 

GAG AAC GGG CCT GCT TAC TGG CAC CTC TAC AAG AAA GAC CAC GAC ATA GCT 
Glu Asa Gly Pro Ala Tyr Trp His Leu Tyr Lys Lys Asp Kis As? He Ala 

GAA AGC CTT GGC ATC GAC CCG ATA ACA GGC GGA ATC GAC TCC GCC AGG ATC 
Glu Sir Leu Gly Me; Asa Ala Ha Arg Gly Gly lis Glu Try Ala Arj He 

TTC CCA AAA CCC ACC TTT GAC GTG AAG GTT GAC GTG GAA AAG GAC GAA AAC 

Phe Pro Lys Pro Thr Phe Asp Val Lys Val As? Val Glu Lys Asp Glu Asn. 

GGG AAC ATA ATC TCC ATT GAC GTC CCG GAG AGC GCG ATA GAG GAG CTA GAA 
Gly Asn. He lie Ser He Asp Val Pro Glu Ser Ala He Glu Glu Le<.: Glu 



AAG CTT GCC AAC ATG GAT GCC CTC 
Lyg Leu Ala Asn Met; Asp Ala Leu 

TGG AAG GAG AGG GGC AAG ACC TTC 
Trp Lys Glu Arg Oly Lys Thr Phs 

CCC CTC TGG CTC CAC GAC CCG ATA 

?ro Lau Trp Leu sis Asp Pro Ha 

GCT CCC TCG GGC TGG CTC- GAC GAG 
Ala Pro ser Gly Trp Lau Asp Glu 

GCT GCA TTC ATC GCC TAC CAC TTG 

Aia Ala Phs lie Ala Tyr Kis Leu 

ATG AAC GAG CCG AAT GTG GTT 7AC 

Mac Asn. Glu Pro Asn Val Val Tyr 

GGC TTT CCA CCG GGT TAT CTC AGC 

Gly Phe pro Pro Gly Tyr Lau Ser 

AAC CTC ATG CAG GCT CAC GCT AGA 
Asn L«u Mat Gin Ala Kis Ala Arg 



AAC CAC TAC CGC GAA ATC TAC TCG GAC 
Asn His Tyr Arg Glu Ue Tyr Ser Asp 

ATA TTG AAC CTC TAT CAC TGG CCC CTT 
lie Lau Asn Leu Tyr His Trp pro Leu 

GGC GTT AGA AAG CTC GGC CCT GAT AGA 
Gly Val Arg Lys Lau Gly Pro Asp Arc 

AGG AGC GTG GTG GAG TTC ACC AAG TTC 
Arg Ser Val Val Glu Pna Thr Lys Pha 

GAT GAC CTC GTT GAC ATG TGG AGC ACG 
Asp Asp Leu Val As? He; Trp Ssr Thr 

GAG CAG GGT TAC ACG AGG CCT CAG 7CC- 
Glu Gin Gly Tyr Thr Arc Pro Gin Sar 

CAC GAG GCC GCT GGA AAG GCG AAG CTC 

His Glu Ala Ala Gly Lys Aia Lys Leu 

GCT TAC GAT GCG ATA AAA GAG CAT TCG 

Ala Tyr Asp Ala He Lys Glu Kis Ser 



GAC AAG CCA GTT GGA GTT ATC TAG 

Asp Lya Pro Val Gly Val lie Tyr 

GAA GCT GCA GAG GAA TCC GTT CTG 

Glu Ala Ala Glu Glu Ser Val Leu 

GTT GAT GGT CTC TAC TCA GGC AAG 
val Asp Gly Leu Tyr Ser Gly Lys 

TTC AAA GGC AGG GTC GAC TGG GTT 
Phe Lys Gly Arg Val Asp Trp Val 

TXT GGA AAG GCC GGA GAT TCA GTG 

Pbe Gly Lys Ala Gly Asp Ser Val 

TCC CCG AGG GGT GGC TAC GCC AAA 
Sar Pro Arg Gly Gly Tyr Ala Lys 

TGG GAG ATT TAT CCT GAG GGC CTC 
Trp Glu He Tyr Pro Glu Gly Leu 



GCA TAT AAG TGG ATT GAT GCG GAG GAT 
Ala Tyr Lys Trp He Asp Ala Glu Asp 

GAA CTC CC-C AGG AGG GAT TAC GAC 7TC 

Glu. Lau Arc Arg Arg Asp Tyr Asp Pha 

TCC CTG ACT GCA GGT GAC- AGG GAG GAC 
Sar Lau Thr Ala Gly Glu Arg Glu Asp 

GGC GTC AA.C TAC TAC TCC CGC CTG CTC 
Gly Val Asn Tyr Tyr Ser Arg Lau Leu 

AGA TTA CTT GAG GGC TAC GGT TTT GTC 
Arg Leu Leu Glu Gly Tyr Gly Pha Val 

TCG GGA AGG CCT GCG AGC GAT TTT GGC 
Ser Gly Arg Pro Ala Ser Asp Phe Gly 

GAA AAG CTC. CTG GTT GAG CTG AGT GGC 
Glu Lys Leu Leu Val Glu Leu Ser Gly 



AGG TAC GAG CTT CCG CTC TTC ATA ACG GAG AAT GGT ATG GCT GAT GCT GTC 



Arg Tyr Glu. Leu Pro Leu Phe lis 

GAT AGG TAC AGG CCT TAG TAG CTC 
Asp Arg Tyr Arg Pro Tyr Tyr Leu 

GCG ATC GAG AAG GGT GCC GAC ATT 

Ala Mez GLu Lys Gly Ala Asp lie 

GAC AAC TAC GAG TCG GCG CAG GGC 
Asp Asn Tyr Glu Trp Ala Gin Gly 

GTG GAC TTC GAG ACT AAG AAG CGC 
VaL Asp Phe Glu Thr Lys Lys Arg 

AGG GAA ATA GCC ACG CGG AAG GAA 

Arg Glu lis Ala Thr Arg Lys Glu 



Thr Glu Asn Gly Mac Ala Asp Ala Val 

GTG AGC CAC CTC GCG GCT ATC CAC AGG 
Val Ser His Leu Ala Ala lie His Arg 

AGG GGG TAC CTC CAC 7GG TCT CTG ACC 

Arg GLy Tyr Leu His Trp Ser Leu Thr 

TTC AGA ATG CGC TTT GGG CTG GTG ATG 
Phe Arg Men Arg phe Gly Leu val Mec 

TAC TTG AGG CCG AGC GCA CTC GTC TTC 
Tyr Leu Ar? Pro Ser Ala Leu Val Phe 

ATA CCC GAA GAGCTT GAA CAC CTT GCC 

lie Pro Glu Glu Leu Glu His Leu Ala 



GAT GTG GAT GCA ATC ATT GCT CGG TGA 14 54 

Asp Val Asp Ala lie He Ala Arg END 
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A7G CTA CCA GAA GAC TTC CTA TOO GGC GTT CGG CAG TCA GGC TTT CAG TTC 
Met Leu Pro Glu GLu Phe Lau Trp Gly Val Gly Glr. Sar C-ly =>. 3 Gin ?r.a 

GAA ATG GGC C-AC AAG CTC AGG AGG CAC ATC GAT CCA AAT A" GAG CCC TGG 
Glu Mae Gly As? Lya Lau Arg Arg His XI* Asa Pro A3- Thr A3? Trp Trp 

AAG TGG GTT CGC GAT CCT TTC AAC ATA AAA AAG GAG CTT GTG AGT GC-G SAC 
Lys Trp Val Arg As? Pro Pha Aar. lis Lys Lys Glu Leu Val 2sr Gly Asp 

CTT CCC GAG GAC GGC ATC AAC AAC TAC GAA CTT TTT GAA AAC GAT CAC AAG 
Lau Pro Glu An Gly lis Asa Asn Tyr Glu Leu Phs Glu Aar. As? His Lya 

CTC GCT AAA GGC CTT GGA CTC AAC GCA TAC AGG ATT GGA ATA GAS TG3 AQC 
Lau Ala Lys Gly Leu Gly Lau Aati Ala Tyr Arc lie Gly Zla Glu Trp Sir 

AGA ATC TTT CCC TGG CCG ACG TGG ACG GTC GAT ACC GAG C-TC GAG TTC C-AC 

Arj He Phe Pro Tr? Pro Thr Trp Ma Val Asp Thr Glu Val Glu Ru A3? 

ACT TAC GGT TTA GTA AAG GAC GTT AAG ATA GAC AAG TCC ACC CTT GCT GAA 

Thr Tyr Gly Leu VaL Lya Asp Val Lys I La Asp Lys Sar T"r Lau Ala Glu 



CTC GAC AGG CTG GCC AAC AAG GAG GAG G7A ATG TAC TAG AGG CGC G7T ATT 
Leu Asp Ar? Leu Ala Asn Lya Glu Glu Val Met Tyr Tyr Arg Arg Val Us 

CAG CAT TTG AGG GAG CTC C-GC TTC AAG GTC TTC GTT AAC CTC AAC CAC TTC 

Gin His Leu Arg Glu Leu Gly Phe Lys val Phe val Asn Leu Asn Kis ?he 

ACG CTT CCA ATA TC-G CTC CAC GAC CCG ATA GTG GCA AGG GAG AAG GCC CTC 

Thr Leu Pra lie Trp Leu His Asp Pro lie Val Ala Arc Glu Lys Ala Leu 

ACA AAC GAC AGA ATC GGC TGG GTC TCC CAG AGG RCA GTT GTT GAS TTT GCC 

Thr Asn As? Arg lis Gly Trp Val Ssr Gin Arg Thr Val Val Glu She Ala 

AAG TAT GwT GCT TAC ATC GCC CAT GCG CTC GGA GAC CTC GTG GAC ACA TGG 
Lys Tyr Ala Ala Tyr II* Ala His Ala Leu Gly Asp Leu Val Asp Thr Trp 

AGC ACC TTC AAC GAA CTT ATG GTA GTT GTG GAG CTC GGC TAC CTC GCC CCC 
Ser Thr Phe Asa Glu Pro Met Val Val Val Glu Leu Gly Tyr Leu Ala Pro 

TAC TCA GGA TTT CCC CCG GGA GTC ATG AAC CCC GAG GCC GCG AAG CTG GCG 

Tyr Ser Gly Phs Pro Pra C-ly Val Msc Asn Pra Glu Ala Ala Lys Leu Ala 

ATC CTC AAC ATG ATA AAC GCC CAC GCC TTG GCA TAT AAG ATG ATA AAG AGG 

lie Leu Asn Met Ila Asn Ala His Ala Leu Ala Tyr Lys Mec lis Lys Arg 



TTC GAC ACC AAG AAG GCC GAT GAG 

Ph<5 Asp Thr Lys Lys Ala Asp Glu 

ATA ATC TAC AAC AAC ATC GOT GTT 

lie lie Tyr Asa Asn tie Gly Val 

AAG GAC GTT AAA GCA GCC GAA AAC 
Lys Asp Val Lys Ala Ala Glu Asn 

TTT GAT GCC ATC CAC AAG GGT AAG 
5ha Asp Ala II* Sis Lys Gly Lys 

TTT GTA AAA GTT AGA CAC CTA AAA 
Phs Val Lys Val Arg Kis Lau Lys 

TAC ACC CGC GAG GTT GTT AGA TAT 
Tyr Thr Arg Glu Val Val Arg Tyr 

CTC ATA TCC TTC AAG GGC GTT CCC 
L«u Ha Sar Sb« Lys Gly Val Pro 



GAT AGC AAG TCC CCT GCG GAC GTT GGC 
Asp Ser Lys Ser Pro Ala Asp Val Gly 

GCC TAC CCT AAA GAC CCT AAC GAT CCC 

Ala Tyr Pro Lys Asp Pro as?, asc Pre 

GAC AAC TAC TTC CAC AGC GGA CTG TTC 
Asp Asn Tyr Phe His Sec Gly Lau Ph* 

CTC AAC ATA GAG TTC GAC GGC GAA AAC 
Lau Asn Ila Glu Pha Asp Civ Glu Asr. 

GGC AAT GAC TGG ATA GGC CTC AAC TAC 
Gly Asn Asp Trp He Gly Lau Asn Tyr 

TCG GAG CCC AAG TTC CCA AGT ATA CCC 

Sar Glu Pro Lys Pha Pro Ser lis Pre 

AAC TAC GGC TAC TCC TGC AGG CCC GGC 
Asn lyr Gly Tyr Ser Cys Arg Pro Gly 



ACG ACC TCC GCC GAT GGC ATG CCC GTC AGC GAT ATC GGC TGG GAA GTC TAT 



Thr Thr Ser Ala Asp Gly Mec Pro 

CCG CAG GGA ATC TAC GAC TCG ATA 
Fro Glr. Gly He Tyr Asp Ser lie 

GTT TAC GTC ACC GAG AAC GGT GTT 
Val Tyr Val Thr Glu Asa. Gly Val 

TAC TAC ATA GTC AGC CAC GTC TCA 
Tyr Tyr lie Val Ser His Val Ser 

TAC CCC GTA AAA GGC TAC ATC TAC 
Tyr Pro val Lys Gly Tyr Met Tyr 

GCC CTC GGC TTC AGC AT<3 AGG TTT 
Ala Leu Gly phe Ser Met Arg Pisa 

AAG GAG AGG ATC CCG AGG GAG AGA 
Lys Glu Arg He Pro Arg Glu Arg 

CAG TCC AAC GGT GTT CCT AAG GAT 

Gin Sar Asn Gly Val Pro Lys Asp 



Val Sar Asp He Gly Trp Glu Val Tyr 

GTC GAG GCC ACC AAG TAC AGT GTT CCT 
Val Glu Ala Thr Lys Tyr Ser Val Pre 

GCG GAT TCC GCG GAC ACG CTG AGG CCA 
Ala Asp Ser Ala Asp Thr Leu Arg Pro 

AAG ATA GAG GAA GCC ATT GAG AAT GGA 
Lys He Glu Glu Ala lie . Glu A3r. Gly 

TGG GCG CTT ACG GAT AAC TAC GAG TGG 
Trp Ala Leu Thr Asp Asn Tyr Glu Tr? 

GGT CTC TAC AAG GTC GAC CTC ATC TCC 
Gly Leu Tyr Lys val As? Leu lis Ser 

AGC GTT GAG ATA TAT CGC AGG ATA GTG 
Ser Val Glu He Tyr Arg Arg He Val 

ATC AAA GAG GAG TTC CTG AAG GGT GAG 

He Lys Glu Glu Phe Leu Lys Gly Glu 



GAG AAA TGA LS39 
Glu Lys END 
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ATG CTA CCA GAA GAG TTC CTA TGG GGC GTT GGC CAG TCA GGC TTT CAG TTC 

Mac Leu Pro Glu Glu Phe Leu Trp Gly Val Gly Gin 3er Gly Shs Gir. =rte 

GAA ATG GGC GAC ASG* CTC-AGG AGG CAC ATC GAT CCA AAT ACC GAC TGG TGG 

Glu Mac Gly Asp Lys Leu Arg Ass Kis He As? Pro Asn Tiir Asp Trp Trp 

AAG TGG GTT CGC GAT CCT TTC AAC ATA AAA AAG GAG CTT GTG AGT GGG GAC 

Lys Trp Val Arg Asp Pro Phe Asn lie Lys Lys Glu Leu va.1 ser Gly Asp 

CTT CGC GAG GAC GGC ATC AAC AAC TAG GAA CTT TTT GAA AAC GAT CAC AAG 

Leu Pro Glu Asp Gly £la Asn. Asn. Tyr Glu L*u Pfca Glu Asn Asp Kis Lys 

CTC OCT AAA GGC CTT CCA CTC AAC GCA TAG GGG ATT GGA ATA GAG TGG AGC 

Leu Ala Lys Gly Leu Gly Leu Asn. Ala Tyr Gly lie Gly lie Glu Trp Sar 

AGA ATC TTT CCC TGG CCG ACG TGG ACG GTC GAT ACC GAG GTC GAG TTC GAC 

Arg XI* Phe Pro Trp Pro Thr Trp Thr Val Asp Thr Glu Val Glu Ph« Asp 

ACT TAC GGT TTA GTA AAG GAC GTT AAG ATA GAC AAG TCC ACC CTT GCT GAA 

Thr Tyr Gly Leu Val Lys A3= val Lys lie Asp Lys Ser Tr.r Leu Ala Glu 



CTC GAC AGS CTC GCC AAC AAG GAG 
Leu Asp Arg Leu Ala Aan Lys Glu 

CAG CAT TTG AGG GAG CTC GGC TTC 
Gin His Leu Arg Glu Leu C-ly Phe 

ACG CTT CCA ATA TGG CTC CAC C-AC 
Thr Leu Pro lie Trp Leu His Asp 

ACA AAC GAC AGA ATC GGC TGG GTC 
Thr Asn. Asp Arg lie Gly Trp Val 

AAG TAT GCT GCT TAC ATC GCC CAT 
Lys Tyr Ala Ala Tyr Ii« Ala His 

AGC ACC TTC AAC GAA CCT ATG GTA 
Ser Thr Phe Asn Glu Pro Met Val 

TAC TCA GGA TTT CCC CCG GGA GTC 
Tyr Ser Gly Phe Pro Pro Gly Val 

ATC CTC AAC ATG ATA AAC GCC CAC 
lie Leu Asa Mee lie Asn Ala His 



GAG GTA ATG TAC TAC AGG CGC GTT ATT 
Glu Val Met Tyr Tyr Arg Arg Val lie 

AAG GTC TTC GTT AAC CTC AAC CAC TTC 
Lys val Phe val asp. Leu Asn His Phe 

CCG ATA GTC- GCA AGG GAG AAG GCC CTC 
Pro lie Val Ala Arg Glu Lys Ala Leu 

TCC CAG AGG ACA GTT GTT GAG TTT GCC 
Ser Gin Arc Thr Val Val Glu PftS Ala 

GCG CTC GGA GAC CTC GTG GAC ACA TGG 
Ala Leu Gly As? Lau Val Asp Thr Trp 

GTT GTG GAG CTC GGA TAC CTC GCC CCC 
Val Val Glu Leu Gly Tyr Leu Ala Pro 

ATG AAC CCC GAG GCC GCG AAG CTG GCG 
Met Asn Pro Glu Ala Ala Lys Leu Ala 

GCC TTG GCA TAT AAG ATG ATA AAG AGG 
Ala Leu Ala Tyr Lys Met lie Lys Arg 



TTC GAC ACC AAG AAG GCC GAT GAG GAT AGC AAG TCC CCT GCG GAC GTT GGC 

Phe. As? Thr Lys Lys Ala As? Glu Asp Ser Lys Ser Pro Ala Asc val Gly 

ATA ATC TAC AAC AAC ATC GGT GTT GCC TAC CCT AAA GAC CCT AAC GAT CCC 
lie lie Tyr Asn Asn lis GLy val Ala Tyr Pro Lys As? Pro Asr. As? Pro 

AAG GAC GTT AAA GCA GCC GAA AAC GAC AAC TAC TTC CAC AGC GGA CTG TTC 
Lys Asp Val Lys Ala Ala Glu Asn Asp Asn Tyr Pile His Ser Giy Leu Pile 

TTT GAT GCC ATC CAC AAG GGT AAG CTC AAC ATA GAG TTC GAC GGC GAA AAC 
Phe As? Ala lie His Lys Gly Lys Leu Asn lie Glu Phe As? Gly Glu Asr. 

TTT GTA AAA GTT AGA CAC CTA AAA GGC AAT GAC TGG ATA GGC CTC AAC TAC 
Phe Val Lys Val Arg His Leu Lys Gly Asn Asp Tr? He Gly Leu Asr. Tyr 

TAC ACC CGC GAG GTT GTT AGA TAT TCG GAG CCC AAG TTC CCA AGT ATA CCC 
Tyr Thr Arg Glu Val Val Arg Tyr Sar Glu Pro Lys Phe Pro Ser lis Pro 

CTC ATA TCC TTC AAG GGC GTT CCC AAC TAC GGC TAC TCC TGC AGG CCC GGC 
Leu Ha Ser Phe Lys Gly Val Pro Asa Tyr Gly Tyr Ser Cys Arg Pro Gly 

ACG ACC TCC GCC GAT GGC ATG CCC GTC AGC GAT ATC GGC TGG GAA GTC TAT 



Thr Thr Ser Ala Asp Gly Met Pre Val 

CCC-CAG GGA ATC TAC GAC TCG ATA GTC 
Pro Gin Gly He Tyr Asp Ser lie V*l 

GTT TAC GTC ACC GAG AAC GGT GTT GC3 
Val Tyr val Thr Glu Asn Gly Val Ala 

TAC TAC ATA GTC AGC CAC GTC TCA AAG 
Tyr Tyr Ha Val Ser His Val Ser Lys 

TAC CCC GTA AAA GGC TAC ATG TAC TGG 
Tyr Pro val Lys Gly Tyr Men Tyr Trp 

GCC CTC GGC TTC AGC ATG AGO TTT GGT 
Ala Leu Gly Phe Ser flee Arg Ptia Gly 

AAG GAG AGG ATC CCG AGG GAG AGA AGC 
Lys Glu Arg Ha pro Arg Glu Arg ser 

CAG TCC AAC GGT GTT CCT AAG GAT ATC 
Gin Ser Asn Gly Val Pro Lys Asp lie 

GAG AAA TGA 

Glu Lys END 



Ser Asp He Gly Trp Glu Val Tyr 

GAG GCC ACC AAG TAC AGT GTT CCT 
Glu Ala Thr Lys Tyr Ser Val Pro 

GAT TCC GCG GAC ACG CTG AGG CCA 
Asp Ser Ala Asp Thr Lau Ar= Sro 

ATA GAG GAA GCC ATT GAG AAT C-C-A 
ILa Glu Glu Ala lis Glu Asn Gly 

GCG CTT ACG GAT AAC TAC GAG TGG 
Ala Leu Thr Asp Asn Tyr Glu Trp 

CTC TAC AAG GTC GAC CTC ATC TCC 
Leu Tyr Lys val Asp Leu I.e Sar 

GTT GAG ATA TAT CGC AGG ACA GTG 
val Glu He Tyr Arg Arg :ia Val 

AAA GAG GAG TTC CTG AAG GGT GAG 

Lys Glu Glu She Leu Lys Gly Glu 
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ATC CGT CCA TTC TTG TTA ATT TCT ATT TTG SAC TTT CGA GTT GCT GAC TAC 

Met Arc pro Pfcs Leu Leu lie Sar lie Leu Asp Phe Arg Val Ala As? Tyr 

CTC CAA CGT AAC ATA 'AAG JVCA CAA AAC CAA TAT TGG GCA TTG 7GC GTA GTA 
Lau Gin Arg As- Ila Lys Thr Gin Asa Gin Tyr Trp Ala Lau Cys Val vai 

ATG TTC TCC AAT GTT CTT AGA TGG CAA AAC TTA AAT ATT TCA CCA GCG GTG 
Met Phe Sar Asn Vai Lau Arc Trp Gla Asn Lsu Asa Ila Sax Pro Ala Val 

ATA CAT AGA GAC ACC GCT GAA CAC AGA CGT GAT TCC ATG AAC AAG TTT GTC 
Ila His Arc Asp Thr Ala Glu "is Arg Gly Asp Sar Ma: Lys Lys Mia Vai 

GCC CTG TTC ATA ACC ATG TTT TTC GTA GTG AGC ATG GCA GTC GTT CCA CAG 
Ala Leu Pha lie Thr Mec Phe Pha Vai Val Sar Mac Ala Val Val Ala Gin 

CCA GCT AGC GCC GCA AAG TAT TCC GAG CTC GAA GAA GGC GGC GTT ATA ATG 
Pro Ala Ser Ala Ala Lys Tyr Sar Glu Lau Glu Glu Giy Gl-y Val Ila Mac 

CAG GCC TTC TAC TGG GAC GTC CCA GGT GGA GGA ATC TGG TGG GAC ACC ATC 
Gin Ala Ph* Tyr Tro Asp Val Pro Giy Gly Gly Ila Trp Trp A3p Thr Ila 



AGG AGC AAG ATA CCG GAG TGG TAC GAG 
Arg Ser Lys He Pro Glu Trp Tyr Glu 

CCG CCA GCC AGC AAG GGG ATG AGC GGC 
Pro Pro Ala Ser Lys Gly Met Ser Gly 



GCG GGA ATA TCC GCC ATT TGG ATT 
Ala Gly He Ser Ala lie Trp lie 

GGT TAC TCG ATG GGC TAC GAT CCC 
Gly Tyr Ser Mac Gly Tyr Asp Pro 



TAC GAT TTC TTT GAC CTC GGC GAG TAC AAC CAG AAG GGA ACC ATC GAA ACG 
Tyr Asp Phe Phe Asp Leu Gly Glu Tyr Asn Gin Lys Gly Thx He Glu Ths 



CGC TXT GGC TCT AAA CAG GAG CTC 
Arg Phe Gly Ser Lys Gin Glu Leu 

TAC GGC ATA AAG GTC ATA GCG GAC 
Tyr Gly Il« Lys Val He Ala Asp 

GAC CTC GAG TGG AAC CCG TTC GTT 
Asp Leu Glu Trp Asr. Pro Phe Val 

AAG GTG GCC TCG GGC AAA TAT ACT 

Lys Val Ala Ser Gly Lys Tyr Thr 



ATC AAT ATG ATA AAC ACG GCC CAT GCC 
He Asn Met He Asn Thr Ala His Ala 

ATC GTC ATA AAC CAC CGC GCA GGC GGA 
He Val He Asn Kis Arg Ala Gly Sly 

GGG GAC TAC ACC TGG ACG GAC TTC TCA 

Gly Asp Tyr Thr Trp Thr Asp Fas Ser 

GCC AAC TAC CTC GAC TTC CAC CCC AAC 
Ala Asn Tyr Leu Asp Phe His Pro Asn 



GAG GTC AAG TGC TGT GAC GAG GGC ACA TTT GGA GGC TTC CCA CAC ATA GCC 
Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly Gly Phe Pro Asp He Ala 



CAC GAG AAG AGC TGG GAC CAG CAC 
Hia Glu Lys Ser Trp Aap Gin His 

GCC GCC TAC CTA AGS AGC ATC GGC 
Ala Ala Tyr Leu Arg Ser lie Gly 

AAG GGC TAC GGA GCG TGG GTC GTC 
Lys Gly Tyr Gly Ala Trp Val Val 

TGG GCC GTT GGC GAG TAC TGG GAC 
Trp Ala val Gly Glu Tyr Trp Asp 

GCC TAC TCG AGC GGC GCC AAG GTC 
Ala Tyr Ser Ser Gly Ala Lys Val 

GAT GAG GCC TTT GAC AAC AAA AAC 
Asp Glu Ala Phe Asp Asa Lys Asn 

AAC GGC CAG ACT GTT GTC TCC CGC 
Asn Gly Gin Tar Val Vai S*r Arg 

GCA AAC CAC GAC ACC GAT ATA ATC 



TGG CTC TGG GCG AGC GAT GAG AGC TAC 
Trp Leu Tr? Ala Ser Asp Glu Ser Tyr 

GTT GAT GCC TGG CGC TTT GAC TAC GTG 
val Asp Ala Trp Arg Phe Asp Tyr vai 

AAG GAC TGG CTC AAC TGG TGG GGC GGC 
Lys Asp Trp Leu Asn Trp Trp Gly Gly 

ACC AAC GTT GAT GCA CTC CTC AAC TGG 
Thr Asn Val Asp Ala Lau Leu Asa Trp 

TTC GAC TTC CCG CTC TAC TAC AAG ATC- 
Phe Asp Ph« Pra Lau Tyr Tyr Lys Met 

ATT CCA GCG CTC GTC TCT GCC CTT CAG 
lie Pro Ala Lau Val Ser Ala Lau GLi 

GAC CCG TTC AAG GCC GTA ACC TTT GTA 
Asp Pra Phe Lys Ala Val Thr Phe Val 

TGG AAC AAG TAC CTT GCT TAT GCT TTC 



Ala Asn His Asp Thr Asp lie He 

ATC -CTC ACC TAC GAA CCC CAG CCC 
Ha Leu Thr Tyr Glu Gly Gl- Pro 

TGG CTC AAC AAG GAC AGG TTG AAC 
Trp Leu Asr. Lys Asp Arg Leu Asr. 

GCA GGT GGA AGC ACG AGC ATA GTT 
Aid Gly GLy Sar Thr Ser He Val 

GTG AGG AAC GGC TAT GGA AGC AAG 
Val Arg ash Gly Tyr Gly Sar Lys 

GGC TCG AGC AAG GTT GGA AGG TGG 
Gly Ser Ser Lys Val Gly Arg Trp 

TGC ATC CAC GAG TAT ACT GGT AAC 
Cys lie Kis Glu Tyr Thr Gly Asn 

TAG TCA AGC GGC TGG GTC TAT TTC 
Tyr Ser Sar Gly Trp Val Tyr Phe 



Trp Asa Lys Tyr Leu Ala Tyr Ai_a Phe 
GTC ATA TTT TAC CGC GAC TAG GAG GAG 

Val ria Phe Tyr Arg Asa Tyr Glu Glu 

AAC CTC ATA TGG ATA CAC GAC CAC CTC 
Asa Lau He Trp lie His As? His Lau 

TAC TAC GAC AGC GAC GAG ATG ATT TTC 
Tyr Tyr Asp Ser Asp Glu Mec He Phe 

CCT GGC CTT ATA ACT TAC ATC AAC CTC 

Pro Gly Leu He Thr Tyr He Asr. Leu 

GTT TAT GTG CCG AAG TTC GCG GGC GCG 
Val Tyr Val Pro Lys Phe Ala Gly Ala 

CTC GGA GGC TGG GTA GAC AAG TAC GTC 
Leu Gly Gly Trp Val Asp Lys Tyr Val 

GAA CCT CCA GCT TAC GAC CCT GCC AAC 
Glu Ala Pro Ala Tyr Asp Pro Ala Asn 



GGG CAG TAT GGC TAC TCC GTG TGG AGC TAT TGC GGT GTT GGG TGA 
Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly EMD 



1S74 
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ATG ATA AAC GTT GCA ACG GGA GAG GAG ACC CCA ATA CAC CTC TTT CGA CTC 
Mec lid Asn Val Ala Thr Gly Glu Glu THr Pro He His Leu Pha Gly Val 

AAC TGG TTC GGC TTT GAG ACA CCG AAC TAG GTT GTT CAC GGC CTA TGG ACT 
Asa Trp Pti« Gly Pha Glu Thr Pro Asn Tyr Val Val His Gly Lau Tr-s S»r 

AGO AAC TSG GAG GAC ATG CTC CTC CAG ATC AAG AGC CTT GGC TTC AAT GC3 
Are Asn Trp Glu Asp M-d Leu Leu Gin lis Lys Ser Lau Gly ?r.s As-. AU 

ATA AGG CTT CCC TTC TGT ACC CAG TCA C-TA AAA CCG GGG ACG ATC- CCA ACG 
II* Arg Lau Pro Phe Cys Tnr Gla S«r Val Lys Pro Gly Thr Me; ?ra Thr 

GCG ATT GAC TAC GCC AAG AAC CCA GAC CTC CAG GOT CTT GAC AGC GTC CAG 
Ala Ha Asp Tyr Ala Lys Asn Pro Asp Lau Gin Gly Leu Asp 3*r Val Cir. 

ATA ATG GAG AAA ATA ATC AAG AAG GCT CGA GAC CTG GGC ATA TTC GT3 CTC 
"la Mee Glu Lys He Ha Lys Lys Ala Gly Asp Leu Gly Ha Pha Val La-i 

CTC GAC TAC CAC AGA ATA GGA TGC AAC TTC ATA GAA CCC CTA TGG TAC ACC 
Leu Asp Tyr His Arc Us Gly cys Asn Pha He Glu Pre Lau Trp Tyr Tr.r 



GAC AGC TTC TCG GAG CAG GAG TAC 

Asp Ser Phe Ser Glu Gin Asp Tyr 

AGG TTC GGC AAG TAC TGG AAC GTT 
Arg Phe Gly Lys Tyr Trp Asn val 



ATA AAC ACC TGG GTT GAA G7C GCC CAG 
lis Asa Thr Trp Val Glu Val Ala Gla 

ATC GGC GCG GAC CTG AAG AAC GAA CCC 
He Giy Ala Asp Leu Lys Asr. Glu Pre 



CAC AGC TCA AGC CCC C-CA CCT GCC GCC TAC ACT GAC GGA AGT GGG C-CC ACC- 
His Sar Ser Ser Pro Ala Pro Ala Ala Tyr Thr Asp Gly Ser Gly Ala Tiir 

TGG GGA ATG GGC AAC AAC GCC ACC GAC TGG AAC CTG GCG GCT GAG AGG ATA 
Trp Gly Mac Gly Asn Asa Ala Thr Asp Trp Asn Leu Ala Ala Glu Arg I la 



GGA AGG GCA ATT CTG GAG GTT GCC 

Gly Arg Ala He Leu GLu Val Ala 

ACC CAG TTC ACC ACC CCC GAG ATA 
Thr Gin. Phe Thr Thr Pro Glu lie 

GCC TGG TGG GGC GGA AAC CTT ATG 
Ala Trp Trp Gly Giy Asn Leu Met 



CCA CAA TGG GTT ATA TTT GTT GAG GGA 

Pro Gin Tr? Val lie Phe Val Glu Gly 

GAC GGT AGG TAC AAG TGG GGC CAC AAC 
Asp Gly Arg Tyr Lys Trp Gly His Asn 

GGT GTT AGG AAG TAC CCA GTT AAC CTG 

Giy Val Arg Lys Tyr Pro Val Asn Leu 



CCC AGG GAC AAG CTT GTT TAC AGC CCC CAA GTT TAC GGT CCA GAC GTT TAC 
Pro Arg Asp Lys Leu Val Tyr Ser Pro Gin Val Tyr Gly Pro Asp Val Tyr 



GAC CAG CCC TAC TTT GAC CCC GGT 

Asp Gin Pro Tyr Phe Asp Pro Gly 

ATA TGG TAC CAC CAC TTC GGC TAC 

tls Trp Tyr His His Phe Gly Tyr 

GTT ATA GGT GAG TTC GGA GGC AAG 
val lis Gly Glu Phe Gly Gly Lys 

GTC ACT TGG CAG AAC AAG ATA ATA 
val Thr Trp Gin Asa Lys He lie 

GAC TTC TTC TAC TGG AGC TGG AAC 
Asp Ph« Phs Tyr Trp Ser Trp Asn 

CTG AAG GAT GAC TGG ACG ACA ATA 
Leu Lys Asp Asp Trp Thr Thr He 

AGG CTC ATG GAC AGC TGT TCT CCA 
Arg Leu Met Asp Ser Cys Ser Gly 



GAG GGG TTC CCC GAC AAC CTC CCC GAA 
Glu GLy Phe Pro Asp Asn Leu Pro Glu 

GTA AAG CTT GAT CTC GGT TAC CCT GTT 
val Lys Leu As? Leu Gly Tyr Pro Val 

TAC GGC CAT GGG GGA GAC CCG AGG GAT 
Tyr Gly His Gly GX^PP<Pre Arg Asp 

GAC TGG ATG ATC CAG AAC AAA TTC TGT 
Asp Trp Met He Gin As.i Lys phi Cys 

CCA AAC AGC GGT GAC ACC GGT GGA ATT 

Pro Asn Ser Gly Asp Thr Gly Gly lie 

TGG GAG GAC AAG TAC AAC AAC CTG AAG 
Trp Glu Asp Lys Tyr Asn. Asn Leu Lys 

AAC GCC ACT GCC CCG TCC GTC CCC ACG 
Asn Ala Thr Ala Pro Ser Val Pro Thr 



ACA ACT ACA ACA ACA AGC ACA CCG CCA ACG ACC ACA ACG ACT ACA ACA TCC 



Thr Thr Thr Thr Thr Sar Thr Pro Pro Thr Thr Thr Thr Thr Thr Thr Ser 

ACT CCA ACG ACC ACT ACC CAG ACC CCG ACC ACC ACT ACT CCA ACT ACG ACA 

Thr Pro Thr Thr Thr Thr Gin Thr Pro Thr Thr Thr Thr Pro Thr Thr Thr 

ACC ACC ACG ACC ACA ACT CCT TCA AAT AAC GTC CCA TTT C-AA ATT GTG AAC 

Thr Thr Thr Thr Thr Thr Pro Ser Asn Asa Val B ro phe Glu He Vai Asr. 

GTT CTC CCG ACT AGC TCC CAG TAC GAG GGA ACC AGC GTG GAG GTT GTA TGT 

Val Leu Pro Thr Sar sar Gin Tyr Glu Gly Thr Sar Val Glu Val Val Cys 

GAT GGA ACC CAG TGT GCC TCC AGC GTT TGG GGA GCT CCG AAC CTC TGG GGA 

Asp Gly Thr Gin Cys Ala Ser Ser Val Trp Gly Ala Pro Asn Lsu Trp Gly 

GTO GTT AAA ATC GGA AAC GCC ACC ATG GAC CCC AAC GTT TGG GGC TGG GAG 

Val Val Lys lie Gly Asa Ala Thr Met Asp Pro Asa Val Trp Gly Trp Glu 

GAC GTT TAC AAG ACT GCA CCC CAG GAC ATT GGA ACC GGC AGC ACA AAG ATG 

Asp Val Tyr Lys Thr Ala Pro Gin Asp Ha Gly Thr Gly Ser Thr Lys Mac 

GAG ATA AGG AAC GGG GTG CTC AAG GTT ACA AAC CTC TGG AAC ATC AAC ATG 

Glu lie Arg Asn Gl/ Val Leu Lys Val Thr Asn Leu Trp Asa Ha Asa KeC 



AEPIIla (?n-y#63GP4) Ji/^-f 

L 

GCT GGA GTG GOT CAG CAA CGG GAT AAC C7A CCA GAT ATT CCC CGA CAu GTT 
Ala Gly val Giy Glu Glr. Arg Asp Asn Leu Pro Asp lie Pro Arg Glr. Val 

CAA CAA CGG AAA CAG GAG CAA CGA TGC CCT AGC TTT GGA CCA CGA CGA GCT 
Gin Glr. Arg Lys Gin Glu Glr. Arg Cys Pro Ser Phe Gly Pra Arg Arr Ala 

AAT TCT GAA CCA GGT CAA TCC AGG CAA ACC AAT CCT CTC CAA CTC- GAG CGA 
Asn Ser Glu Pro Gly cia Ser Arg Glr. Thr Asn Pro Leu Glr. Lau Glu Arg 

CCC TAT AAC GCC CCT CCA CTG CTG CCA CCA CTA CTT CGG CGG CGA CAT AAA 
Pro Tyr Ass Ala Pro Pro Lau Leu Pro Pro Val Leu Arg Arg Arg Kls Lys 

GGG AAT AAC GGA GAA GCT CGA CTA CCT TCA GAG CCT AGG TGT TAC TAT AAT 
Gly Asn Asn Gly Glu Ala Arg Leu Pro Sar Glu Pro Arg Cys Tyr Tyr Asn 

CTA CCT CAA CCC GAT TTT CCT CTC GGG AAG CGC CCA CGG CTA CGA CAC CTA 
Leu Pro Gin Pro Asp Phe Pro Leu Gly Lys Arg Pro Arg Leu Arg Ki* Ltu 

CGA CTA CTA CCG GCT TGA CCC CAA GTT CGG GAC CGA GGA GGA GCT GAG AGA 
Arg Leu Leu Pro Ala End Pro Gin val Arg Asp Arg Gly Gly Ala Glu Arg 



CAT CCG AAG TAT AAC ACA ATG CCA TAC CCG GAG GTC ATA TAC GCC GCC AAG 
His Pro Lys Tyr Asn Thr Met Ala Tyr Pro Glu Val He Tyr Gly Ala Lys 

CCT TGG GGC AAC CAG CCA ATA AAC GCT CCG AAC TTC GTG CTC CCG ATA AAG 
Pre Trp Gly Asn Gin pro Ha Asn Ala Pre Asn PSaa Val Lau Pro lie Lys 

GTC TCC CAG CTT CCG AGG ATA CTC GTT GAC ACA AAG TAC ACG CTC GAA AAS 
Val Sar Gin Lau Pro Arg lis Leu Val Ago Thr Lys Tyr Thr Leu Glu Lys 

AGC TTC CCG GGA AAC AAC TTC GCC TTT GAG GCC TGG CTC TTC AAG GAT GCC 
Ser Phe Pro Gly Asn Asn Phe Ala Phe Glu Ala Trp Law Phe Lys Asp Ala 

AAC AAC ATG AGG GCA CCA GGC CAG GG3 GAC TAC GAG AGG AAT TCC GCC GAT 
Asa Asr. Met Arg Ala Pro Gly Gin Gly Asp Tyr. Glu Arg. Asr. Sar Ala As? 

ACT GAC GGG CTC CAG GAG TCG TCG CCA CCA ATC CCC ATA TGG AAA CCG TCG 
Thr Asp Gly Leu Gin Glu Sar Sar Pro Pro *la Pro He Trp Lys Pro «sr 
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ATA AGC TTG CGG CCG CCA CCG CGG TGG AGC TCC AGC TTT TGT TCC CTT TAA 
Ha Ser Leu Arg Pro Pro Pro Arg Trp Ser Ser Ser Ptie Cys ser Leu EJ3 



GTT CCT CGA TGA GGC ACA CAG GCG GGG AAT GAG GGT AAT TTT CCA TTT TGT 
Val Pro Arg End Gly Thr Gin Ala Gly Asn Glu Gly Asn Phe Arg Phe Cys 

3CC CAA CCA CTG CGG CAT AGG GAA TCC ASC CTT CCT AGA AGT TTG GAA GAA 
Ala Gin Pro Leu Arg Kis Arg Glu Sar sar Leu Pro Arg Sar Leu Glu Glu 

GGG CAA CGA AAG CCC ATA CTG GGA CTG GTT CTT CGT CAA GAA GTG GCC GTT 
Gly Gin Arg Lys Pro He Leu Gly Leu Val L*u Arg Gin Glu Val Ala Val 

CAA GCT CGG CGA TGG GAA CGC CTA CGT CGG CTG GTG GGG CTT TGG GAG CCT 
Gin Ala Arg Arg Trp Glu Arg Leu Arg Arg Leu Val Gly Leu Trp Glu Pro 

TCC AAA GCT CAA CAC TGC CAA CCC GGA GGT CAG GGA ATA CCT GAT AGG AGC 
Sar Lys Ala Gin Kis Cys Gin Pro Gly Gly Gin Gly He Pro Asp Arg Ser 

GGC CCT CCA CTG GAT AGA GTT CGG CTT TGA CGG CAT CAG GGT TGA TGT GCC 
Gly p ro pro Leu As? Arg Val Arg Leu End Arg His Gin Gly £r.d Cys Ala 

GAA CGA AGT CCT CGA CCC GGG AAC GTT CTT CCC GGA GCT GAG AAA GGC AGT 
Glu Arg Ser Pro Are Pro Gly Asn Val Leu Pro Gly Ala Glu Lys Gly Ser 

CAA GGA GAA AAA GCC GGA CGC ATA CCT CGT CGG TGA GAT ATG GAC GCT CTC 
Gin Gly Glu Lys Ala Gly Arg lis Pro Arg Arg End Asp Met Asp Ala Leu 



CCC TGA GTG GGT GAA AGG AGA CCG CTT CGA CTC CCT CAT GAA CTA CGC CCT 
Pro find Val Gly Glu Arg Arg Pro Leu Arg Leu Pro His Glu Leu Arg Pro 

CGG GAG GGA CAT CCT CCT GAA CTA CGC GAA GGG CCT GCT CAG TGG AGA AAG 
Arg Glu Gly His Pro Pro Glu Leu Arg Glu Gly Pro Ala GLn Trp Arg Lys 

TGC AAT GAA AAT GAT GGG ACG TTA CTA TGC TTC CTA CGG CGA GAA CGT ATT 
Cys Asn Glu Asn Asp Gly Thr Leu Leu Cys Phe Leu Arg Arg Glu Arg lie 

GCG ATG GGC TTC AAC CTC GTT GAT TCG CAC GAC ACT TCG AGG GTT CTC ACT 
Ala Met: Gly Phe Asn Leu Val Asp Ser His Asp Thr Ser Arg Val Leu Thr 

GAT CTC GGT GGG GGG AGT CTC GGT GAC ACA CCG TCA AAC GAG TCA ATT CAG 
Asp Leu Gly Gly Gly Ser Leu Gly Asp Thr Pro Ser Asn Glu Ser lie Gin 

AGA CTC AAG CTC CTC TCA ACG TCC TCT ATG CCC TGC CTG GAA CTC CGG TCA 
Arg Leu Lys Leu Leu Ser Thr Ser Ser Met Pro Cys Leu Glu Leu Arg Ser 

CCT TCC AGG GGA TGA GAG AGG ACT GCT CGG AGA CAA GGG GCA CTA CGA CGA 
Pro Ser Arg Gly End Glu Arg Thr Ala Arg Arg Gin Gly Ala Leu Arg Arg 

ACA GCG CTA CCC AAT ACA GTG GGA TAC TGT GAA CGA AGA CGT CCT GAA CCA 



Thr Ala Leu Pro Asn Thr Val Gly 

TTA CAG GGC ATT GGC GGA GCT CAG 

Leu Gin Gly lie Gly Gly Ala Gin 

CGC AAT AAG GTT CTA CAC TGC CAA 
Arg Asn Lys Val Leu His Cys Gin 

GCA TCA TGA CGA GGT TCT 7GT CGT 
Ala Ser End Arg Gly Ser Cys Arg 

ACT AAA GCT TCC TGA GGG AGA GTS 
Thr Lys Ala Ser End Gly Arg val 

CCC .GGA ACT GCT TCG CCG CAA AGT 
Pro Gly Thr Ala Ser Arg Gin Ser 



Tyr Cys Glu Arg Arg Arg Pro Glu Pro 

AAA AAG AGT TCC TGC ATT GAG GAG CAG 
Lys Lya Ser Ser Cys lis Glu Glu Gin 

AGG CGG CGT TAT GGC CTT CTT CAG GGG 
Arg Arg Arg Tyr Gly Leu Leu Gin GLy 

TGC CAA CAG CTG GAA GAA GCC AC-C CCT 
Cys Gl.i Girt Leu Glu Glu Ala Ser Pro 

GAA AST AAT CTG GCC TGA GAA TTT CAG 

Glu. Sar Asn Leu Ala End Glu Phe Gin 

TGA AGT GCC AGC CAT AGG GAT AAT CAT 
End Ser Ala Ser His Arg Asa A sr. Kis 



CCT TGA GCG GAG TTG 
Pro End Ala Glu Leu 
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atg act caa tta tat ata aaa aat ccc ctc atc gaa cac ccg gca cat ccc 

Met Thr Glu Leu Tyr rie Lys Asn Pro Leu lie Glu Gin Arg Aid Asp Pro 

TGG ATC TAT AAA CAT ACC GAT GOT TAT TAT TAC TTT ACC GGT ~ZZ GTG CCG 
Trp Il« Tyr Lys His Thr Asp Gly Tyr Tyr Tyr Pha Thr Gly Ser Val Sro 

GAG TAC GAC CGA ATT GAG CTT AGA CGC TCG CAA ACC ATT CAA GGC- CTT CCG 
Glu Tyr Asp Arg H* Glu L«u Arg Arg Sar Ola Thr Ha Glti Gly Leu Ala 



GAT GCC GAA G5A ATT ACG ATC TGG 
Asp Ala Glu Gly Ila The Sis Trp 

GCC AAC ATA TGG GCA CCC GAG ATT 
Ala Asa tla Trp Ala Pro Glu lis 



CGC AAG CAT GAG TCA GGC CT3 ATC AGT 

Arg Lys His Glu Ser Gly Lau Mac Sar 

CAT TAT ATC GAT GGC AAA T3G TAT GTG 

His Tyr Mec Asa Gly Lys Trp Tyr Val 



TAT TAC GCC GCT GCC CAT ACT TCA 
Tyr Tyr Ala Ala Ala His Thr Ser 

CGC ATS TTC GTA TP3 GAG AAC GCT 

Arg Mec Phe Val Leu Glu Asn Ala 



GAA ACG AGG GAC GGA TTC TTC GAT CAC 
Glu Thr Arg Asp Gly Leu '?as Asp Eis 

TCG GCG AAC CCG CTC GAA G53 GAA TGG 

ser Ala Asn Pro Leu GIm G'.y a 1m t™ 



GTG GAG AAG GGG CAA GTG ATC ACG AAG TGG GAA TCT TTC GCC TTG GAC GCA 
Val Glu Lys Gly Gin Val ILe Thr Lys Trp Glu Sec Phe Ala Leu Asp Ala 

ACG ACG TTC GAG CAT AAA GGC AAA CGG TAC TAT GTA TGG GCT CAG AAA GAT 

Thr T-r Phe C-Iu Kis Lys Gly Lys Arg Tyr Tyr Val Trp Ala Gin Lys Asp 

CCG GGC ATT CCA GGC AAT TCC AAT CTG TAT ATC TCA TTC- ATG C-AA GAC CCG 
Pro Gly lie Pro Gly Asn ser Asn Leu Tyr lie Ser Leu Met Glu Asp Pro 

TGG ACC CTG ACA GGG GAA CAG GTA TGC ATA TCG GTT CCC GAG TAC GAT TGG 
Trp Thr Leu Thr Gly Glu Gin Val Cys He Ser Val Pro Glu Tyr Asp Trp 

GAG AAG ATC GGG TAT CTT GTG AAT GAA GGG GCC GCC GTT CTT AAG CGA AAC 
Glu Lys He Gly Tyr Leu Val Asn Glu Gly Ala Ala Val Leu Lys Arc Asn 

GGG CGA ATA TTC ATG ACC TAT TCC GC3 AGC GCC ACG GAC CAC AAC TAT GC3 
Gly Arg lie Phe Met Thr Tyr Ser Ala Ser Ala Thr Asp Kis Asn Tyr Ala 

ATG GGS CTG CTG ACA GCC GAT GAA GAC AGT GAT TTG CTG AAT CCG AGC TCC 
Met Gly Leu Leu Thr Ala Asp Glu Asp Ser Asp Leu Leu Asn Pro Ser Ser 

TGG GTC AAG TCG CCT GTA CCT GTA TTT ACG ACA TCT GAA GCC AAT GGC CAA 
Trp Val Lys Ser Pro Val Pro Val Phe Thr Thr Ser Glu Ala Asn GLy Glu 



TAT GGT CCG GGG' CAC AAC AGC TTC 
Tyr Gly Pro Gly His Asn Ser Phe 

ATT TTG GTA TAC CAT GCA AGA AGT 
lis Leu Val Tyr His Ala Arg sar 

ATG ATC CGA ACC GTC ATA CGC GTG 
Met lie Arg Thr Val He Arg Val 



ACG ATT TCC GAG GAC GGC TTG CAG GAC 
Thr He Ser Glu Asp Gly Leu Gin Asp 

TAC AAG GAG ATC GTC GGG ATC CAC TAT 
Tyr Lys Glu He Val Gly 11* Sis Tyr 

TAC AGG TCA TCC GAT GGA ACG AAG ACG 
Tyr Arg Sex Ser Asp Gly Thr Lys Thr 



GAA CGC CGA ATT TCG GGG TGC CAA GAG CGG ATC ATG AAC CGG TCT CCA AGC 

Glu Arg Arg He Ser Gly Cys Gin Glu Arg He Mec Asa Arg Sar Fro Sar 

CAT GAT GCC GAC TTT GTC ATT GGG GTT GTG ACC GGA AGG ATT AAC AAA CAT 
Kxs Asp Ala Asp Phe Val He Gly Val Val Thr Gly Arg He Aan Lys Kis 



CAG ACC GAC TGA 1031 
Gin Thr ASp END 
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ttc aat aac acc att cca aga tgg cgt ggt ttc aac ctt ctg gag gcc ttt 

Leu Asa Asa Thr lie Pro Arg Trp Arg Gly Phe Asa Leu Leu Glu Ala Phe 

TCC ATT AAA ACT ACA GGA AAT TTT AAA GAG GAA GAT TTT TTG TGG ATG GCT 
Ser He Lys Ser TI:r Gly Asn 5&e Lys Glu Glu Asp Phe Leu Trp Met Ala 

CAG TGG GAC TTT AAT TTT GTT AGA ATC CCT ATG TGT CAT CTT CTC TGG TCA 
Gin Trp Asp Phe Asa Phe Val Arg He Pra Met cys Sis Leu Leu Trts Ser 

GAC CGG GGC AAC CCA TTT ATT ATC AGA GAA GAT TTT TTT GAG AAA ATC GAT 
Asp Arg Gly Asa Pro Pise lis lis Arg Glu As? Phe Phe Glu Lys Ha Asp 

CGT GTA ATT TTC TGG GGA GAG AAA TAT GGA ATA CAT ATA TGT ATT TCT CTT 
Arg Val Xle She Trp Gly Glu Lys Tyr Gly Ha His lie Cys tie Ser Leu 

CAC AGG GCA CCT GGC TAT TCT GTT AAC AAG GAA GTA GAA GAG AAA ACC AAT 
His Arg Ala Pro Gly Tyr Ser Val Asn Lys Glu Val Glu Glu Lys Thr Asa 

CTG TGG AAA GAT GAA ACA GCT CAA GAA GCG TTC ATT CAT CAC TGG TCT TTT 
Leu Trp Lys Asp Glu Thr Ala Gin Glu Ala Phe lie His His Trp Ser Phe 



ATC GCA CGT CGT TAC AAA CGA ATT TCT TCC ACA CAC CTG AGT TTT AAC TTA 
He Ala Arg Arg Tyr Lys Gly lis Ser Ser The His Leu Ser Ptie Asn Leu 

ATA AAT GAG CCT CCA TTT CCT SAT CCA CAA ATC ATG AGT GTT GAA GAT CAC 

He Asn Glu Pro Pro Phe Pro Asp Pro Gin He Met Ser Vai C-lu Asp His 

AAC TCT CTT ATC AAG AGA ACT ATT ACA GAA ATT CGA AAA ATA GAT CCT GAA 
Asn Ser Leu Zl» Lys Arg Thr Xl« Thr Glu lie Arg Lys lis Asp Pro Giu 

AGA TTA ATT ATA ATA GAT GGA TTA GGC TAT GGG AAT ATT CCA GTG GAT GAT 
Arg Leu He He He Asp Gly Leu Giy Tyr Gly Asn He Pro Val Asp Asp 

TTA ACA ATT GAG AAT ACA GTG CAA TCA TGC AGA GGG TAC ATT CCC TTC AGT 
Leu Thr He Glu Asn Thr Vai Gir. Ser Cya Arg Giy Tyr He Pro She Ser 

GTT ACT CAT TAC AAA GCG GAA TGG GTG GAT AGT AAG GAC TTT CCT GTT CCT 
Val Thr Kis Tyr Lys Ala Giu Trp Val Asp Ser Lys Asp Phe Pro Vai Pro 

GAG TGG CCA AAT GGA TGG CAT TTT GGG GAA TAC TGG AAC AGA GAA AAG TTA 
Giu Trp Pro Asn Gly Trp Kis Phe Gly Glu Tyr Trp Asn Arc Glu Lys Leu 

TTG GAA CAT TAT TTA ACG TGG ATA AAA CTC AGA CAA AAA GGA ATA GAA GTA 

Leu Glu His Tyr Leu Thr Trp tie Lys Leu Arg Gin Lys Giy He Glu Val 



TTC TGT GGA GAA ATG GGA GCT TAC 
Sin Cys Gly Glu Hat Gly Ala Tyr 

AAA TGG CTT GAA GAT CTT TTA GAA 

Lys Trp Leu Glu Asp Leu Leu Glu 

GCC TTA TGG AAT TTT AGA GGT CCT 
Ala Lau Trp Asn Pi« Arg Gly Pro 

GAC GTT GAA TAC GAA GAA TGG TAT 
Asp Val Glu Tyr Glu Glu Trp Tyr 

GAA CTA TTG AGA AAA TAT TAG 
Glu Leu Lau Arg Lys Tyr End 



AAC AAA ACA CCT CAC GAT GTC GTT TTA 
Asn Lys Thr Pro His Asp Vai Val Leu 

ATT TTT AAA ACT TTG AAC ATA GGG TTT 

lie Phe Lys Th.r Leu Asn Ila Gly Phe 

TTT GGT ATT TTA GAT TCG GAA AGG AAA 
Phe Gly Ila Lau Asp Sar Glu Arg Lys 

GGA CAT AAA CTG GAT AGG AAA ATG TTG 
Gly Kis Lys Lau Asp Arg Lys Met Leu 
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ATG CTC TCA GAG ATT GTT CCG TAT ACT GTT CTG AGA AGA GAA AGA ATA GAA 
Mac Leu Ser Glu lie Val Pro Tyr Thr Val Leu Arj Arg Glu Arg He Glu 

AGC TGG ATT TTC TCC GAT GAT GCT GTT GAG AGA ATC GTG GAT CCT TCC TTC 

Ser Trp lie Phe Ser Asp As? Ala Val Glu Arg lie Val Asp Pra Ssr Phfl 

GAA TGG GAC TTC AGC TCC GCT CCC GTC CGG TTC AGG AAA GAG CTA GAG CCT 
Glu Trp Asp 9na Ser Ssr Ala Pro val Arg Phe Arg Lys Glu Leu Glu Pro 

TTC TCC GTC GCT GGA GAG CAG AGG GCC TAC CTG AAA CTC TGG TTC GGT GGT 
Phe Ser Val Ala Gly Glu Gin Arg Ala Tyr Leu Lys Leu Tr? Pha Gly Gly 

GAA ACA CTC GTT CTG ATA GAT GGG AAG CCT TAC GGT GAG ATC AAC C-AG TAT 
Glu Thr Leu Val Lau lie Asp Gly Lys Pro Tyr Gly Glu :ia Ajn Glu Tyr 

CAT AGG ATG TTG AAC ATC ACC CCC CTT GCT GAT GGA AAA CCA CAC ACG ATA 
His Arg Mes Leu Asa He Thr Pro Leu Ala Asp Gly Lys Pro Kis Thr :i* 

GAA GCT CAG GTG ATG CCA AGG GGT CTC TTT GGA AAA CCA GAA AAG CCG GTG 
Glu Ala Gin Val Man Pro Arg Gly Lau Phe Gly Lys Pro Glu Lys Pro Val 



TTC ACG GAA GCT TTC TTC ATC GTC 
Phe Thr Glu Ala Phe Phe Ue Val 

AAA ACT CTC GAA CTC ACT ATA AAA 
Lys Thr Leu Glu Leu Thr rie Lys 

CTT TCT AAG AAA CTT CTG GAC ATC 
Leu Ser Lys Lys Leu Leu Asa He 

ATC CCA AGA GAC ACA GGT ACC TAT 
He Pro Arg Asp Thr Gly Thr Tyr 

ATA AAA GAT GAG ATC AAA AAC ACC 
He Lys Asp Glu tie Lys Asa Thr 

ACA GGT GTG AAG 
Thr Gly Val Lys 

GAA AAA TTC AAA GAA AAG CTG GAT 
Glu Lys Phe Lys Glu Lys Leu Asp 

GGA ACG ATT CAC CTT GTG GGG CAC 
Gly Thr lie His Leu Val Gly Kis 



GTT GAT GAA GCA CTG ATC AAG GTG GTG 
val Asp Glu Ala Leu Met Lys Val Val 

ACG GCA GAA GTG ATA GAA GAC GAG TCG 
Thr Ala Glu val lis Glu Asp Glu sar 

TCC GAG 'GAG TTT CTC TCG AAA GTA TGG 
Ser Glu Glu Phe Leu Ser Lys Val Trp 

CTG ATG ACA GCA CTG GAG GAT CCG GGA 

Leu Mec Thr Ala Leu Clu Asp Pro C-ly 

TGG AAC ACA CCG GAG TTC AAA GAG TTC 
Trp Asia Thr Pro Glu Phe Lys Glu Phe 



AGA ATA AGA AAA AAC CAT CCG GGT TTT 
Arg lis Arg Lys Asn Kis Pro Gly Phe 

GCG CAC ATA GAC TAC GCC TGG CTC TGG 
Ala His He Asp Tyr Ala Trp Leu Trp 



CTT CCT GAA GAG TTG AGA AAT CAG ATT CTG GAA GAG TTC 
Leu Pro Glu Glu Leu Arg Asn Glr. He Leu Glu Glu Phe 



CCA CTT GAG GAG ACG AAG AGA AAG ATC CTA CGC ACT TTC GCA AAC TCT G7G 
Pro Val Glu Glu Thr Lys Arg Ly3 lie Leu Arg Thr Phs Ala Asr. Ser Val 

TTG CTC TCT AAG CTT TAT CCG GAG TTC GTT TAC ACT GAG TCT TCT GCT CAG 
Leu Leu Ser Lys Leu Tyr Pro Glu ?ne Val Tyr Thr Glr. Ser Ser Ala Gin 

ATG TAC GAG GAT CTC AAG CAA AAT TCA CCA GAG CTT TTC GAG GAA GTG AGA 
Ke; Tyr Glu Asp Leu Lys Gin ash ser Pro Glu Leu Phe Glu Glu val Arg 

AAG CTC GTA GAA GAG GGG AGA TGG GAG CCA GTC GGT GGC ATG TGG GTG GAG 
Lys Leu Val Glu Glu Gly Arg Tr? Glu Pro Val Gly Gly Mec Tr= Val Glu 

TCG GAC TGC AAC GTT CCA ICS ATA GAG TCG CTT GTG AGA CAG TTC TAC TAT 
Ser. Asp Cys Asn val Pro Ser lie Glu Ser Leu Val Arg Gin Phs Tyr Tyr 

GGG CAA AAA TTC TTC GAA AGA GAA TTC GGG AAA AAG AGC AAG GTG TGC TGG 
Gly Gift Lys Phe Pha Glu Arg Glu Phe Gly Lys Lys Sar Lys Val Cys Tr? 

CTT CCG GAT GTG TTT GGG TTT TCC TGG GTG CTT CCC CAA ATT CTG AAA GAA 
Leu Pro Asp Val Phe Gly Phe Ser Trp Val Leu Pro Gin lie Leu Lys Glu 

GCC.GGG ATA AAA TAC TTC GTC ACC ACG AAA CTC AAC TGG AAC GAC ACG AAC 



Ala Gly lie Lys Tyr Phe Val Thr Thr Lys Leu Asn Trp Asn Asp Thr Asn 



GAG TTT CCG TAC GAT CTG TGC CGC TGG AGG GGA ATA GAT GGA TCC GAA GTG 
Glu Phe Pro Tyr Asp Leu Cys Arg Trp Arg Gly lie Asp Gly Ser Glu Val 

ATC TAT TTC AST TTC AAA AAT ' CCC AAC GAG GGG TAG AAC GGA AAG ATA GAT 
lie Tyr Phe sar Pile Lys Asn Pro Asn Glu Gly Tyr Asn Gly Lys Xle Asp 

CCC GAT ACG GTG TAC AAA ACC TGG AAG AAC TTC AGG CAG AAA GAT CTC ACA 
Pro Asp Thr Val Tyr Lys Thr Trp Lys Asn Phe Arg Gin Lys Asp Leu Thr 

AAC AGA GTT CTT CTT TCG TTC GGA CAC GGT GAT GGT GGT GGC GGT CCA ACC 
Asn Arg Val Leu Leu Ser Phe Gly His Gly Asp Gly Gly Gly Gly Pro Thr 

. GAA- GAG ATG CTG GAA AAT TAC GAG GTT CTG AAG GAT TTC CCT GGA CTA CCG 

Glu Glu Mec Leu Glu Asa Tyr Glu Val Leu Lys Asp Phe Pro Gly Leu Pro 

CAC CTT GAA ATG GGA ACT GTG GAA GAA TTT TTC AAG AAG GTG GAG ATC GAC 
Hia Leu Glu Met Gly Thr Val Glu Glu Phe Phe Lys Lys val Glu lie Asp 

GAA GAA CTC CCT GTG TGG GAC GGA GAG CTT TAC CTT GAA CTT CAC AGG GGA 
Glu Glu Leu Pro Val Trp Asp Gly Glu Leu Tyr Leu Glu Leu His Arg Gly 



ACC TAG ACT TCT CAC TTC AGO ACA AAG AAA CTT CAC AAA GAA GCG GAA GAC 
Thr Tyr Thr Ser Gin Pha Arg Thr Lys Lys Leu His Lyz Glu Ala Glu Asp 

AGT CTT TAT CTT GCA GAG TTG ATC TCG GCT TTC ACG GAT AAA GAT TTT TCG 

Ser Leu Tyr Lau Ala Glu Leu He Ser Ala Phs Thr Asp Lys Asp She Ser 

GAC GAA ATA GAC GAA CTC TGG AAG ATT CTG TTG AGA AAC C-AA TTT CAC C-AT 
Asp Glu Ila As? Glu' Leu Trp Lys lie Leu Leu Arg Asn Glu Pha Sis Asp 

ATT CTA CCT GGA TCT TCT ATA AAG GAA GTC TAT GAA GAT ACA GAA AAA GAG 
He Leu Pro Gly Ser Ser He Lys Glu Val Tyr Glu Asp Thr Glu Lys Glu 

CTC AGA CAT C-TG ATA GAA AAA TCA AAA GAC ATC GTT ATC GAA TCT CTC AAA 
Leu Arg Kis Val Ila Glu Lys Ser Lys Asp He Val lie Glu Ser Leu Lys 

GTT CTT TCC TCT GAG AAC AAA GAT GTT CTA ACC ATT TTG AAC GCT TCA TCG 

Val Leu Ser Ser Glu Asn Lys Asp Val Leu Thr lie Leu Asn Ala Sar Ser 

TTT CCA AAG AAG TGT CTT TTC TTC CTC AAC GAA GAT CTC GCG ATT TCC TTT 
Phe Pro Lys Lys Cys Leu Phe Phe Leu Asn Glu Asp Leu Ala lis Ser Phe 

GAA GGA GAA GCA CTC TTG AAA CAG AAA ACT CAC GAT GGA AGG TAT GTG TAC 
Glu Gly Glu Ala Leu Leu Lys Gin Lys Thr His Asp Gly Arg Tyr Val Tyr 



TTC ATA GAC AGG GAG ATT CCT CCG TTC ACG AAA GTA GAA CTG AAA GTT CGC 
She ile Asp Arg Glu lie Pro Pro Phe Thr Lys Val Glu Leu Lys Val Arg 

AAA GCC ACG TCT GAG GAA ACT CCA ACT GAG TTG AGA GAA ACA AAC ATC ATG 

Lys Ala Thr Ser Glu Glu Thr Pro Ser Glu Leu Arg Glu Thr Asn lis Mac 

GAG AAC GAA TTT CTC AGG GTG CAC GTC AAC GAT GAC GGA ACA ATT CAA ATC 
Glu Asr. Glu Phe Leu Arg Val His Val Asn Asp Asp Gly Thr lie Gla lie 

TAC GAC AAA GAA CTG GAC AGG TAC GTT TTC GAA GAG AAG GGA AAC ATC TTG 
Tys Asp Lys Glu Lau Asp Arg Tyr Val Phe Glu Glu Lys Gly Asr. I la Leu 

AAA CTT CAT AAA AAC ATC CCT GCT TAC TGG GAC AAC TGG GAT ATC GCA GAA 
Lys Leu His Lys Asn lie Pro Ala Tyr Trp Asp Asa Trp Asp lie. Ala Glu 

AAC GTG GAA AAG ACA GGA TAT ACC CTG AGG GCC AAA AAC ATA GAA AAA ATA 
Asn Val Glu Lys Thr Gly Tyr Thr Leu Arg Ala Lys Asn He Glu Lys He 

GAG TCT GGC CCT GTT CGA GAA GTG ATC CGT GTT GAA CAT GAA TCA GAA GGA 
Glu Ser Gly Pro Val Arg Glu Val He Arg Val Glu His Glu Ser Glu Gly 

AGC AGG ATC ACG CAG CAT TAC ATC CTT TAC AGA AAG AGT AGA AGG CTC GAT 



Ser Arg lie Thr Glr. His Tyr lie Leu Tyr Arg Lys Ser Arg Arg Leu As? 

ATA GAA ACG AAG GTA GAC TGG CAC ACA AGG CST GCG CTT CTC AGA GCC TAC 
lis Glu Thr Lys Val Asp Trp His Thr Arg Arg Ala Leu Leu Arg Ala Tyr 

TTC CCA ACA ACT GTT CTG TCG AGA AAG GCT AGG TTC GAT ATC TCC GGT GGT 
Phe Pro Thr Thr Val Leu Sai Arg Lys Ala Arg Phe Asp lie Ser Gly Gly 

TTC ATC GAA AGG CCC ACA CAC AGA AAC ACC AGT TTC GAA CAG GCG CGT TTC 
Pha He Glu Arg Pro Thr Kis Arg Asn Thr Ser Pha Glu Gin Ala Arg Phs 

GAG GTG CCG TTT CAC AGG TGG ATG GAT CTT TCC CAG ACA GAC TTC GGC GT3 
Glu val Pro Pha His Arg Trp Mec Asp Leu Ser Gin Thr Asp She Gly vai 

.TCC ATT CTG AAC GAC GGA AAA TAC GGT GGC AGT GTT CAT CAG GGT ATC AT3 

Ser He Leu Asn Asp Gly Lys Tyr Gly Gly Ser Val His Gin. Gly lis Mst 

GCG CTT TCA CTG ATA AAA GCG GGT ATT TTC CCC GAT TTT CTC TGT GAC GAA 
Ala Leu Ser Leu He Lys Ala Gly lie Phe Pro Asp Phe Leu Cys Ass Glu 

GGC GAA CAC ACT TTC ACC TAT TCT GTC TAC GTA CAC CCT GGA GAC AGC TTG 

Gly Glu His Thr Phe Thr Tyr Ser Val Tyr Val Kis Pro Gly Asp Ser Lsu 



AGA GAT GTT GTA AAA GGA TCA GAA 
Arg Asp Val Val Lys Gly Ser Giu 

CGC GGG G7G TTG AAC CTC CCC TCT 
Arg Gly val Leu Asn Leu Pro ser 

TTC CGT CTC ACC TCA CTC- AGA AGG 
She Arg Leu Thr Ser Leu Arg Arg 

GTT GAG ATT TTC GGA ACA TCA GGG 
Vai Glu lie She Gly Thr Ser Gly 

GGT GAA ATC TAT CAG ACG AAC GTT 
Giy Glu He Tyr Gin Thr Asn Val 

TTC CCA GTG GTT TAC CAT CCG TTC 
Phe Pro Val Val Tyr Kis Pro Phe 



GAT CTC AAC AGA TCT TTC ATC GTT CAT 
Asp Leu Asa Arg Ser Ph« He Val Kis 

CCT TTA CTG GAG ATC TCT CCT CAA AAC 
Pro Leu Leu Glu lie Ser Pro Gin Asa 

GTG AAG GAC AAA ATT GTT TTG AGG CTT 
Val Lys Asp Lys He Val Leu Arc Lau 

AAA CTT TCC ATT AAA CTC CCA T3G CAT 
Lys Leu Ser He Lys Lau Pro Trp Kis 

CTG GAA GAG AAA AAA CAG AAA GTC ACC 
Leu Glu Glu Lys Lys Gin Lys Val Thr 

AAG ATC TAC ACT TTT GTT GTA GAA GGT 
Lys XI* Tyr Thr Phe Val Val Glu Gly 



TGA 
END 
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ATG CAA C7<3 TAC AGG GAT CCT TCG CAA CCC ATC GAA GTG AGA GTG AGA GAT 
Mac Glu Leu Tyr Ar? Asp Pro Ser Gin Pre tie Glu Val Arj Val Arc As? 

err err tcc aga atg kcg cts gaa gag aaa gtg ccc cag ctt ggg tct gtc 
Leu Leu Ser AT? Met Thr Leu Glu Glu Lys Vei Ala Gla Leu Gly ssr Val 

TGG GGT TAC GAA CTG ATA GAC GAG AGG GGA AAG TTC AGT AGA GAA AAA GCA 
Trp Gly Tyr Glu Leu lie Asp Glu Arg Gly Lys pfce Ser Arg Glu Lys Ala 

AAA GAA CTC CTC AAA AAT GGT ATA GGC CAG ATC ACA AGG CCT GGT GGA TCA 
Lys Glu Leu Leu Lys Asa Gly lie Gly Glc lis Thr Arg Pro Gly Gly Ser 

ACG AAC CTT GAA CCT CAA GAA GCC GCG GAA CTT GTG AAC GAA ATA CAG AGA 
Thr Asa Leu Glu Pro Gin Glu Ala Ala Glu Leu Val Asn Glu He Gin Arg 

TTT CTT GTG GAA GAA ACA CGC CTT GGA ATT CCT GCG ATG ATA CAC GAA GAA 

Phs Leu val Glu Glu Thr Arg Leu Gly He Pro Ala Ptec He Sis Glu Glu 

TCT CTC ACC GGT TAC ATG GGA CTT GGA GGA ACC AAC TTC CCT CAG GCG ATA 
Cya Leu Thr Gly Tyr Mec Gly Leu Gly Gly Thr Asn P&a Pra Gla Ala tla 



GCA ATG GCC AGT ACA TGG GAT CCA GAT CTC ATA GAA AAA ATG ACC ACC GCC 
Ala Met Ala Ser Thr Tcp Asp Pro Asp Leu lie Glu Lys Met Thr Thr Ala 

GTC AGA GAG GAT ATG AGA AAG ATA GGG GCA CAT CAG GGT CTC GCA CCT G7T 

Val Arg Glu Asp Mac Arg Lys He Gly Ala His Gin Gly Leu Ala fro Val 

CTC GAT GTC GCA AGA GAT CCA AGG TGG GGG AGA ACA GAA GAG ACG TTC C-GA 

Lau Asp Val Ala Arg Asp Pro Arg Trc Gly Arg Thr Glu Glu Thr Phe Gly 

GAA TCT CCC TAT CTG GTG GCG AGG ATG GGA GTC TCT TAC GTG AAA GGC CTC 
Glu Sar Pro Tyr Leu Val Ala Arg Mac Gly Val Sar Tyr Val Lys Gly Leu 

CAG GGG GAA GAT ATC AAA AAA GGT GTC GTT GCC ACA GTG AAA CAC TTC GCC 
C-ln Gly Glu Asp lie Lys Lys Gly Val Val Ala Thr Val Lys His Phe Ala 

GGA TAC AGC GCT TCT GAA GGT GGA AAG AAC TGG GCA CCA ACG AAC ATT CCC- 
Gly Tyr Ser Ala Ser Glu Gly Gly Lys Asm Trp Ala Pro Thr Asn lie Pro 

GAG AGG GAA TTC AAA GAG GTC TTT CTC TTT CCG TTC GAA GCG GCC GTT AAA 

Glu Arg Glu Phe Lys Giu Val Phe Leu Phe Pro Phe Glu Ala Ala Val Lys 

GAA GCG AAT. GTG CTT TCT GTG ATG AAC TCC TAC AGC GAA ATA GAC GGT GTC 
Glu Ala Asn Val Leu Ser Val Met Asn Ser Tyr Ser Glu lie A30 Gly Val 



CCA TGT CCA GCG AAC AGG AAA CTC CTC ACA GAC ATT CTC AGA AAA GAC TGG 

Pro Cya Ala Ala Asn Arg Lys Leu Leu Thr Asp lie Leu Arg Lys Asp Trp 

GGA TTC GAA GC-A ATC GTC GTT TCT GAC TAT TTT GCT GTG AAA GTT CTG GAA 

Gly Phe Glu Gly He v *- Ser Asp Tyr phe Ala val Lys Val Leu Gli: 

GAT TAT CAC AGA ATA GCA AGG GAT AAG TCA GAA GCC GCA AGA CTC GCA CTT 

Asp Tyr His Arg lie Ala Arg Asp Lys Ser Glu Ala Ala Arg Leu Ala Leu 

GAA GCG GGG ATA GAT GTT GAA CTT CCG AAG ACA GAA TGT TAT CAA TAT TTG 

Glu Ala Gly lie Asp Val Glu Leu Pro Lys Thr Glu Cys Tyr Gl- Tyr Leu 

AAA GAC CTT GTT GAA AAA GGC ATC ATC TCC GAA GCT TTG ATC GAC GAG GCA 
Lys Asp Leu Val Glu Lys Gly He lis Ser Glu Ala Leu tie As? Glu Ala 

GTC ACC AGG GTG CTG AGG CTG AAG TTC ATG CTC GGG CTC TTC GAA AAT CCC 
Val Thr Arg Val Leu Arg Lau Lys ?he Met Leu Gly Leu ?he Glu Asr. Pro 

TAC GTT GAG GTG GAA AAA GCA AAG ATA GAA AGT CAC AGA GAC ATC GCA CTC 
Tyr Val Glu Val Glu Lys Ala Lys He Glu Ser His Arg Asp ILe Ala Leu 

GAG ATA GCA AGG AAA TCC ATT ATC CTT CTC AAG AAT GAT GGA ATT CTG CCT 



Glu He Ala Arg Lys Ser lis He 

CTT CAG AAA AAC AAA AAA C7T GCC 
Leu Gin Lys Asa Lys Lys Val Ala 

AGA AAT CTC CTC GGA GAT TAC ATG 
Arg Asn Leu Leu Gly Asp 7yr Mec 

GAC AAC ATA GAC GAC GTC TTT GGA 
Asp Asn He Asp Asp val Phe Gly 



Leu Leu Lys Asn Asp Gly Ha Leu Pra 

CTG ATC GGA CCG AAC GCG GOT GAG GTC 
Leu He Gly Pro Asn Ala Gly Glu Vai 

TAC CTT GCA CAC ATA AGG GCT CTC CTC 
Tyr Leu Ala His He Arg Ala Lau Leu 

AAT CCT CAG ATC CCG AGA GAA AAC TAC 

Asn Pra Girt He Pro Arg Glu Asn Tyr 



GAA AGA CTG AAG AAG AGC ATA GAA GAA CAT ATG AAG AGC ATT CCG AGT GTT 
Glu Arg Leu Lys Lys ser He Glu Glu His Met Lys S*r lie Pro Sar Val 

CTC GAT GCC TTC AAA GAA GAA GGG ATC GAA TTC GAA TAT GCA AAA GGC TGT 

Leu Asp Ala Phe Lys Glu Glu Gly He Giu Phe Glu Tyr Ala Lys Gly Cys 



GAA GTG ACA GGG GAA GAC AGA AGC GGT TTC GAA GAG GCG ATA GAA ATT GCA 

Glu Val Thr Gly Glu Asp Arg Ser Gly Phe Glu Glu Ala He Glu He Ala 



AAG AAA TCC GAC GTT GCC ATC GTT GTC GTA GGG GAC AAA TCT GGA CTC ACC 
Lys Lys Ser Asp Val Ala He Val Val Val Gly Asp Lys Ser Gly Leu Tr.r 



CTT GAC TGC ACA ACC GGT GAG TCC AGA GAC ATG GCA AAC CTC AAG CTT CCA 
Leu Asp Cys Thr Thr Gly Set " A£, 9 As P Mec ^l 3 Asrl Leu L >' 3 Leu Pre 

GGA GTC CAG GAA GAA CTC GTC CTC GAA GTT GCA AAG ACA GGA AAA CCC GTC 
Gly Val Gin Glu Glu Leu Val Leu Glu Val Ala Lys Thr Gly Lys Pro Val 

GTT CTT GTC CTC ATC ACG GGA AGA CCC TAT TCA CTC AAA AAC GTC GTjfcrGAC 
Val Leu Val Lau lis Thr Gly Arg Pro Tyr Ser Leu Lys Asn Val Val As? 

AAG GTG AAC GCG ATC CTT CAG GTG TGG CTT CCT GGA GAA GCG GGA GGA AGA 
Lys Val Asn Ala lie Leu Girt Val Trp Leu Pro Gly Glu Ala Gly Gly Arg 

GCG ATC GTT GAC ATC ATC TAT GGA AAG GTG AAT CCC TCT GGA AAA CTC CC3 
Ala He Val Asp He lie Tyr Gly Lys Val Asn Pro Sar Gly Lys Lau Pro 

ATC AGC TTT CCA AGA AGC GCT GGT CAG ATT CCT GTC TTC CAC TAC GTC AAA 
He Ser Phe fro Arg Sar Ala Gly Gin He Pro Val Prv* Kis Tyr Val Lys 

CCA TCC GGG GGA AGG TCT CAC TGG CAC GGA GAC TAC GTG GAT GAG AGC ACA 
Pro Ser Gly Gly Arg Ser Kis Trp His Gly Asp Tyr Val Asp GLu Ssr Thr 

AAG CCT CTC TTC CCG TTT GGG CAC GGT TTG TCT TAC ACG AAG TTC GAG TAC 
Lys Pro Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Thr Lys Phe Glu Tyr 



AGC AAC CTC AGA ATC GAG COG AAG GAA GTG CCA CCS GCC GGC GAA GTG GTG 
Sar Asn Leu Arg lie Glu Pro Lys Glu Val Pro Pro Ala Gly Glu val Val 

ATA AAG GTG SAC GTG GAA AAC ATC GGA GAC AGA GAC GGA GAC GAG GTG GTT 

lie Lys Val Asp Val Glu Asr. Ha Gly Asp Arg Asp Gly Asp Glu val val 

CAA CTT TAC ATC GGT CGT GAG TTT CCA AGC GTC ACA AGG CCT GTG AAA GAG 
Gin Leu Tyr lie Gly Arg Glu Phe Ala Ser Val Thr Arc Pro Val Lys Glu 

CTG AAG GGC TTC AAG AGG GTT TCT TTG AAG GCG AAA GAG AAG AAG ACT GTT 
Leu Lys Gly Pha Lys Arg Val Ser Lau Ly9 Ala Lys Glu Lys Lys Thr Val 

GTG TTC AGG CTT CAC ATG GAC GTG CTC GCC TAC TAC AAC AGA GAC ATQ AAA 
Val Phe Arg Lau Eis Mat Asp Val Lau Ala Tyr Tyr Asn Arg Asp Met Lys 

CTC GTG GTT GAA CCC GGT GAG TTC AAA GTG ATG GTG GGA AGC TCT TCT GAA 

Leu Val Val Glu Pro Gly Glu Phe Lys Val Mec Val Gly Sar Ser Sar Glu 

GAC ATC AGA CTC ACA GGT TCT TTC TCC GTC GTC GGT GAA AAA AGA GAA GTG 
Asp lis Arg Leu Thr Gly Ser Phe Ser Val Val Gly Glu Lys Arg Glu Val 

GTG GGA ATG AGG AAA TTC TTC ACG GAA GCC TGC GAG GAG TGA 2336 
Val Gly Met Arg Lys Phe Phe Thr Glu Ala Cys Glu Glu EMD 



1 

ATG GGG ATT GGT GGC GAC GAC TCC TGG AGC CCC TCA CTA TCG GCG GAA TTC 
MeC Gly lie Gly Gly A« A« Sar T r? Ser Pro Ser V*l S ar Ala Glu Pha 



CTT TTA TTG ATC GTT QSG CTC TCT TTC GTT C7C TTT CCA AGT GAC GAG TTC 
Lau Lau Leu n* V.l Glu Lau S« Ph 2 Val Leu PAe Ala Ser Asp Glu 9h* 

GTG AAA GTG GAR AAC GGA AAA TTC GCT CTG AAC GGA AAA GAA TTC AGA TTC 
val Lys VaX Glu Asn Gly Lys Phe Ala Leu Asn Gly Lys Glu Pha Arc Ph* 

ATT GGA AGC AAC AAC TAC TAC ATG CAC TAG AAG AGC AAC GGA ATG ATA GAC 
He Gly Ser Asa Asa Tyr Tyr Mec His Tyr Lys Sar Aan Gly Mas Sit Asp 

AGT GTT CTG GAG AGT CCC AGA GAC ATG GGT ATA AAG GTC CTC AGA ATC TGG 
Sar Val Leu Glu Sar Ala Arc Asp Mac Gly Ila Lys Val lau Arc lis T~ 

GGT TTC CTC GAC GGG GAG AGT TAC TGC AGA GAC AAG AAC ACC TAC ATG CAT 
Gly eha Lau Asp Gly Glu sar Tyr Cys Arg Asp Lys Asn Thr Tyr Mec His 

CCT GAG CCC GGT GTT TTC GGG GTG CCA GAA GGA ATA TCG AAC GCC CAG AGC 
Pro Glu Pro Gly Val Phe Gly Val Pro Glu Gly Ila Ser Asn Ala Gin sar 



GGT TTC GAA AGA CTC GAC TAC ACA 
Gly Phe Glu Arg Leu Asp Tyr Thr 

AAA CTT GTC ATT GTT CTT GTG AAC 

Lys Leu val lie Val Leu Val Asn 

CAG TAC GTG AGG TGG TTT GGA GGA 
Gin Tyr Val Arg Trp Phe Gly Gly 

GAG AAG ATC AAA GAA GAG TAC AAA 
Glu Lys lie Lys Glu Glu Tyr Lys 

GTC AAT ACC TAC ACG GGA GTT CCT 

Val Asn. Thr Tyr Thr Gly Val Pro 

TGG GAG CTT GCA AAC GAA CCG CGC 

Trp Glu Leu Ala Asa Glu Pro Arg 

CTC CTT GAG TGG GTG AAG GAG ATG 
Leu Val Glu Trp Val Lys Glu Meu 

AAC CAC CTC GTG GCT GTG GGG GAC 
Asn His Leu Val Ala Val Gly Asp 



GTT GCG AAA GCG AAA GAA CTC GGT ATA 
Val Ala Lys Ala Lys Glu Leu Giy lie 

AAC TGG GAC GAC TTC GGT GGA ATG AAC 
Asn Trp Asp Asp Phe Gly Gly Mec Asr. 

ACC CAT CAC GAC GAT TTC TAC AGA GAT 
Thr His His Asp Asa Phe Tyr Arg Asp 

AAG TAC GTC TCC TTT CTC GTA AAC CAT 
Lys Tyr Val Ser Phe Leu Val Asn Kis 

TAC AGG GAA GAG CCC ACC ATC ATG GCC 

Tyr Arg Glu. Glu Pr= Thr He Mec Ala 

TGT GAG ACG GAC AAA TCG GGG AAC ACG 
Cys Glu Thr Asp Lya Ser Gly Asn Thr 

AGC TCC TAC ATA AAG AGT CTG GAT CCC 
Ser Ser Tyr He Lys ser Leu Asp Pro 

GAA GGA TTC TTC AGC AAC TAC GAA GGA 

Glu Gly Phe Phe Ser Asn Tyr Glu Gly 



TTC AAA CCT TAC GGT GGA GAA GCC 
Phe Lya Pro Tyr Gly Gly Glu Ala 

GTT GAC TOG AAG AAG CTC CTT TCG 

Val Asp Trp Lys Lys Leu Laii'sar 

CAC CTC TAT CCG TCC CAC TGG GOT 
His Leu Tyr Pro Ser His Trp Gly 

GGA GCG AAG TGG ATA GAA GAC CAC 
Gly Ala Lys Trp lis Glu Asp Els 



GAG TCC CCC TAC AAC GCC TGG TCC GGT 
Glu Trp Ala Tyr Asn Gly Trp ser Gly 

ATA GAG ACS GTG GAC TTC GGC ACG TTC 

lie Glu Thr val Asp Pr.a Gly Thr Pha 

GTC AGT CCA GAG AAC TAT GCC CAG TGG 
Val Ser Pro Glu Asn Tyr Ala Gin Trp 

ATA AAG ATC GCA AAA GAG ATC GGA AAA 
He Lys Ha Ala Lys Glu lis Gly Lys 



CCC GTT GTT CTG GAA GAA TAT GGA 
Pro. Val Val Leu Glu Glu Tyr Gly 

ACG GCC ATC TAC AGA CTC TGG AAC 

The Ala tie Tyr Arg Leu Trp Asn 



ATT CCA AAG AGT GCG CCA GTT AAC AGA 

He Pro Lys Ser Ala Pro Va.1 Asn Arg 

GAT CTG GTC TAC GAT CTC GGT GGA GAT 

Asp Leu Val Tyr Asp Leu Gly Gly Asp 



GGA GCG ATG TTC TGG ATG CTC GCG GGA ATC GGG GAA GGT TCG GAC AGA GAC 
Gly Ala Met Pne Trp Mec Leu Ala Gly lie Gly Glu Gly Ser Asp Arg Asp 

GAG AGA GGG TAC TAT CCG GAC TAC GAC GGT TTC AGA ATA GTG AAC GAC GAC 



GLu Arg Gly Tyr Tyr Pro Asp Tyr 

AGT CCA GAA GCG GAA CTG ATA AGA 
Ser Pro Glu Ala Giu Leu lie Arg 

GAA GAC ATA AGA GAA GAC ACC TGC 

Glu Asp Ha Arg C-lu Asp Thr- Cys 

GAG ATC AAA AAG ACC GTG GAA GTG 
Glu lis Lys Lys Thr val Glu val 

ACQ TTT GAA AAG T?G TCT GTC AAA 
Thr Phe Glu Lys Leu Ser Val Lys 

ATA GAG CAT CTC GGA TAC GGA ATT 

He Glu Kis Lau Gl/ Tyr Gly Ha 

ATC CCG GAT GGA GAA CAT GAA ATG 
He Pro Asp Gly Glu His Glu Mac 



Asp Gly Phe Arg He Val asr Asp Asp 

GAA TAC GCG AAG CTG TTC AAC ACA GGT 
Glu Tyr Ala Lys Leu Phe Asn Thr Gly 

TCT TTC ATC CTT CCA AAA GAC GC-C ATG 
Ser Pie He Leu Pro Lys Asp Gly Met 

AGG GCT GGT GTT TTC C-AC TAC AGC AAC 
Arg Ala Gly val Phe Asp Tyr Sar Asr. 

GTC C-AA GAT CTG GTT TTT GAA AAT GAG 
Val Glu Asp Lau Val Phe Glu Asn Glu 

TAC G3C TTT GAT CTC GAC ACA ACC CG3 

Tyr Gly Phe Asp Leu Asp Thr Thr Arg 

TTC CTT GAA GGC CAC TTT CAG GGA AAA 
Phe Lau Glu Gly His Phe Gin Gly Lys 



ACG GTG AAA GAC TCT ATC AAA GCG AAA GTG GTG AAC GAA GCA CSG TAC GTG 

Thr Val Lys Asp Ser He Lys Ala Lys Val Val Asn Glu Ala Arg Tyr Val 



CTC GCA GAG GAA GTT GAT TTT TCC TCT CCA GAA GAG GTG AAA AAC TGG TGG 
Leu Ala Glu Glu Val Asp Phs Ser Ser Pro Glu Glu Val Lys Asr. Trp Trp 

AAC AGC GGA ACC TGG CAG GCA GAG TTC GGG TCA CCT GAC ATT GAA TGG AAC 

Asn Ser Gly Ttr Trp Gin Ala Glu Pf.e Gly Ser Pro Asp lie Glu Trp Asr, 

GGT GAG GTG GGA AAT GGA GCA CTG CAG CTG AAC GTG AAA CTG CCC GGA AAG 
Gly Glu Val Gly Asn Gly Ala Leu Gin Leu Asr. Val Lys Leu Pro Gly Lys 

AGC GAC TGG GAA GAA GTG AGA GTA GCA AGG AAG TTC GAA AGA CTC TCA GAA 
Ser Asp Trp Glu Glu Val Arg Val Ala Arg Lys Phe Glu Arg Leu Ser Glu 

TGT GAG ATC CTC GAG TAC GAC ATC TAC ATT CCA AAC GTC GAG GGA CTC AAG 
Cys Glu lie Lau Glu Tyr Asp He Tyr He Pro Asa Val Glu Gly Leu Lys 

GGA AGG TTG AGG CCG TAC GCG GTT CTG AAC CCC GGG TGG GTG AAG ATA GGC 
Gly Arg Leu Arg Pro Tyr Ala Val Leu Asn Pro Gly Trp Val Lys lie Gly 

CTC GAC ATG AAC AAC GCG AAC GTG GAA AGT GCG GAG ATC ATC ACT TTC GGC 
Leu Asp Met Asr. Asn Ala Asn Val Glu Ssr Ala Glu He He Tar Phe Gly 

GGA AAA GAG TAC AGA AGA TTC CAT GTA AGA ATT GAG TTC GAC AGA ACA GCG 
Gly Lys Glu Tyr Arg Arg Phe His Val Arg He Glu Phe Asp Arg Thr Ala 



COG GTG AAA GAA CTT CAC ATA GGA 
Gly Val Lys Glu Leu His ILe Gly 

GGA CCG ATT TTC ATC GAT AAT GTG 

Gly Pro Ila Phe lie Asp Asn Vil 



GTT GTC GGT GAT CAT CTG AGG TAC GAT 
Val Val Gly Asp His Leu Arg Tyr Asp 

AGA CTT TAT AAA AGA ACA GGA GGT ATG 

Arg Leu Tyr Lys Arg Thr Gly Gly Mat 



TGA 2042 
END 
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ATC TTC CTG CAT CCG AGO GGT CSC ATG ACC CCC CTA GCG CTC GGC TGT GCC 
Mac Phe Lau His Pro Arg Gly Arg Mac Ttir Arg Leu Ala Lau Gly Cys Ala 

CTG TGT CTG GCC GTC GCA GGC TGC GGT GGT GGT GAT GAC GAC GGC GAC 
Leu Cys Leu Ala val Ala Gly cys Gly Gly Gly As? Asp Asp Gly Asp 

AAC GGC ACC GCC CCC CAG CCC GCA CCT GGT CAA CCC GAG CCC CCG ACT 

Asn Gly Thr Ala Pro Gla Pro Ala Pro Gly Gin Pro Glu Pro Pro Thr 

GAC ACC GTG CTG AAA GAC TGG CCT CGC ATC AAC AGC AGC ATC 
Asp Thr Val Leu Lys Asp Trp Pro Arg lie Asr. Ser Ser He 

GCA GCG ATC GAA AGC CGC GTC AAC TCA CTC GTC GCG GCG ATG ACG CTG GAA 
Ala Ala He Glu Ser Arg Vai Asr. Ser Lau val Ala Ala Ms: Thr Lau Glu 

GAA AAA GTC GGC CAG ATG ACG CAG GTC GAA ATC CAG GAG GTG ACG CCG GAG 
Glu Lys Val Gly Gla Mec Thr Gla Val Glu He Gin Glu Val Thr Pro C-L'j. 

GAG ATC CCG CAG TAC CAC ATC GGC TCC GTG CTC AAC GGC GGT GGT TCG TTC 
Glu He Arg Gin Tyr His He Gly Ser Val Leu Asa Gly Gly Gly Sar ??.e 



GTG 

Val 

GAC 
Asp 



ACC GCC GAC 
Thr Ala Asp 



CCG AAG CAG GAC AAG GGC GCG GCG GTG ACC GAC TGG CTG GCG GTG GCC GAC 
Pro Lys Gin Asp Ly3 Gly Ala Ala Val Tlxr Asp Trp Leu Ala Val Ala Asp 

GCC TTG TGG GCC GCG TCG ATG GAT CCC GCC AAG CCG CGG CGC ATC CCG CTC 

Ala Leu Trp Ala Ala Sar Mac Asp Pro Ala Lys Pro Arg Arg ILa Pre Leu 

ATC TGG GGC ACC GAC GCC GTC CAC GGC CAC AAC AAC GTC AAG GGC GCG ACC 
lie Trp Gly Thr Asp Ala Val His Gly Kis Asn Asn Val Lys Gly Ala Thr 

ATC TTC CCG CAC AAC ATC GGC CTG GGC GCC GCG CGC GAC CCC GAC TTG GTC 
He Phe Pro Kis Asn lie Gly Leu Gly Ala Ala Arc Asp Pro Asp Leu Val 

GCC CGC ATC GGC GCC GCC ACG GCG CTG GAA GTG GCA CGC ACC GGC ATC GAC 
Ala Arg lie Gly Ala Ala Thr Ala Leu C-lu Val Ala Arg Thr Gly lie Asp 

TGG GTG TTC GCG CCA ACG CTG GCG GTC GTG CGC GAC GAC CGC TGG GGC CGC 
Trp Val Phe Ala Pro Thr Leu Ala Val Val Arg Asp Asp Arg Trp Gly Arg 

AGC TAC GAA GGC TAT TCG GAA GAC CCC GAA ATC GTC GTC TCC TAT GCC GGC 
Ser Tyr Giu Gly Tyr Ser Glu Asp Pro Glu He Val Val Ser Tyr Ala Gly 

AAG ATG GTC GAA GGC CTG CAG GGC CGA TTG GCG CAG GAC GCG AAG GCC AAC 
Lys Met Val Giu Gly Leu Gin Gly Arg Leu Ala Gin Asp Ala Lys Ala Asn 



GAG AAC GTG GTG GCC fl.CC GCC AAG CAT TTC GTC GGC GAC GGC CGC ACC GAC 
Glu Cys Val Val Ala Thr Ala Lys Kis Phe Val Gly Asp Giy Gly Thr Asp 

CAG GGC AAG GAC CAG GGG GTC ACC CGG GTC ACC GAG CGC GAC CTG TTG AAC 
Gin Gly Lys Asp Gin Gly val Thr Arg val Thr Glu Arg Asp Leu Leu Asn 

GTC CAT GCG CGC GGC TAC ATC CCC GCG CTC GAG GCG GGC GCG CAA ACC GTG 

Val His Ala Arg Gly Tyr lie Pro Ala Lau Glu Ala Gly Ala Gin Thr val 

ATG GCC TCC TTC AAC AGC TGG CAG GAC CCG TCG CAG GGC GAG GGC GCC AAG 
Met Ala Ser She Asn Sar Tr? Gin As? Pro Ser Gla Gly Glu Gly Ala Lys 

GCC TTC AAG ATG CAT GGC AGC CGC TAC CTG CTC ACC GAG GCC CTC AAG CAG 
Ala Phss Lys Mac Sis Gly Sar Arg Tyr Lau Lau Thr Glu Ala Leu Lys Gin 

AAG ATG GGC TTC GAC GOT TTC GTG GTG TCC GAC TGG AAC GGC ATC GGC CAG 
Lys Met Gly Phe Asp Gly Pha val Val Sar Asp Trp Asn Gly lie Gly Gin 

GTC ACC ACC GAG AAC AGC AAC GCG ACG CGC AAC TGC AGC AAC AGC GAC TGC 
Val Thr Thr Glu Asn Sar Asn Ala Thr Arg Asa Cys Ser Asn Ser Asp Cys 

CCC GAG GCC ATC AAC GCT GGC ATC GAC ATG GTG ATG GTG CCG TAC CGG GCC 



Pro Glu Ala lie Asn Ala Gly Ila Asp Mat Val Mec Val Pro Tyr Arg Ala 

GAC "TGG AAG GCC TTC ATC ACC AAC ACA ATT GCA ATT GTC CGC AAA GGC GAG 
Asp Trp Lya Ala Phe He Thr Asn Thr lie Ala He Val Arg Lys Giy Glu 

ATC GCG CAG GAG CGC ATC GAC AAC GCG GTG CGG CGC ATC CTG CGC GTC AAG 
Ha Ala Gin Glu Arg lie Asp Asn Ala Val Arg Arg Ha Leu Arc Val Lys 

TTG CGC GCC GGT CTG TTC GAC AAG CCC ACA CCC TCC GCC CGT CTG GCC TCG 
Leu Arg Ala Gly Leu Phe Asp Lys Pro Thr Pro Ser Ala Arg Leu Ala Sar 

CGC GAG GTC GGC AGC GCC GAA CAC CGG GCG CTC GCG CGT GAA GCG GTG. CGC 
Arg Glu Val Gly Ser Ala Glu His Arg Ala Leu Ala Arg Glu Ala Val Arg 

AAG TCG TTG GTG CTG TTG AAG AAC AAC GGC CBG GTG CTG CCG CTG GCA CGC 

Lys ssr Lau Val Leu Leu Lys Asn Asn Gly Arg Val Leu ?ro Leu Ala Arg 

AAT GCC AAG GTC CTG GTG GCC GGC AAG AGC GCC AAC AGC CTC GAG AAC CAG 
Asn Ala Lys Val Leu Val Ala Gly Lys Ser Ala Asn Ser Leu Glu Asn Gin 

ACC GGC GGC TGG TCG CTC AGC TGG CAA GGC ACC GGC AAC GCC AAC GCC GAT 
Thr Gly Gly Trp Ser Leu Ser Trp Gin Gly Thr Gly Asn Ala Asn Ala Aap 



TTC GGC GGC GGC ACG ACC GTG TGG CAG GCG ATC CAG AAG ATC GCC CCG AAT 
Phe Gly Gly Gly Thr Thr Val Trp Gin Ala lie Gin Lys lie Ala Pro Asn 

GCC GAA CTC GAC ACC AGC GCC GAC GGC GCC AAG GGC AGC GAT GCC TAC GAC 
Ala Glu Leu Asp Thr Ser Ala Asp Gly Ala Lys Gly Ser Asp Ala Tyr Asp 

GCC GCG ATC GTC GTG ATC GGT GAA ACA CCG TAC GCC GAA GGT GTC GGA GAC 
Ala Ala He Val Val He Gly Glu Thr Pro Tyr Ala Glu Gly Val Gly Asp 

ATC GGC CGC AGC AAG ACG CTG GAA CTC ACC AAG CTG CGT CCA GAA GAC CTC 
He Gly Arg Ser Lys Thr Leu Glu Leu Thr Lys Leu Arg Pro Glu Asp Leu 

GCC GTG ATC GAA GGC CTG CGC GCC AAG GGC GTG AAG AAA ATC GTC ACG CTG 
Ala Val He Glu Gly Leu Arg Ala Lys Gly Val Lys Lys He Val Thr Leu 

CTG GTC TCC GGC CGC CCG CTC TAC GTC AAC AAG GAG CTG AAC CGC TCG GAC 
Leu Val Sar Gly Arg Pro Leu Tyr Val Asn Lys Glu Leu Asn Arg Ser Asp 

GCC TTC GTG GCG GCG TGG CTG CCC GGC ACC GAA GGC GAC GGC GTC GCC GAC 
Ala Phe Val Ala Ala Trp Leu Pro Gly Thr Glu Gly Asp Gly Val Ala Asp 

GTG CTG TTC CGT GCG GCC GAC GGC AGC GTC CCG CAT GGC TTC AGC GGC AAG 
Val Leu Phe Arg Ala Ala Asp Gly Ser Val Ala His Gly Phe Ser Gly Lys 



CTG TC<3 TTC TCG TGG CCG AAG TCG GCC TGC CAG ACG CCG CTC AAC CGT GGC 
Lau Sar Phe Sar Trp Pro Lys Ser Ala Cys Gin Thr Pro Leu Asn Ar? Giy 

GAC GCC ACC TAC GAC CCG CTC TAC GCT TAT GGC TAC GGC CTT CAA TAC GGC 

Asp Ala Thr Tyr Asp Pro Leu Tyr Ala Tyr Gly Tyr Gly Leu Gin Tyr Gly 

GAG GAG ACC GAT CAG AGC GC3 TAC GAC GAA AGC AGT SCC ACG C-TC GGC TC-C 

Glu Glu Thr Asp Gin Ser Ala Tyr As? Glu Ser Ser Ala Thr Val Gly Cys 

GGC ATC CAG GAC GGC GGC GGC ACC ACG GCC GAG CCG CTG GCG GTG TTC GAA 
Gly lis Gin Asp Gly Gly Gly Thr Thr Ala Glu Pro Lau Ala Val She Glu 

GGC GGA GCC AAC CAG GGC AAC TGG AAG CTG CGC ATC GGC GCC GAG TCG AGC 
Gly Gly Ala Asn Gin Gly Asn Trp Lys Lau Arg Ila Gly Ala Glu. Ser Sar 

TGG AGC AAC GAT GTG ACG CTG GCC AGC AGC GCG GTG ACG TCG ACG CCG TGC 
Trp Sar Asn Asp Val Thr Leu Ala Ser Ser Ala Val Thr Ser Thr Pro Sar 

AAC GAA CTG CAG GCC GTG CCG GTG GAC GAC AAG GCC GGG CGG CAA TGG GCG 
Asn Glu Leu Gin Ala Val Pro Val Asp Asp Lys Ala Gly Arg Gin Trp Ala 

GCG GTG AAG GCG ACC TGG AAC GAC AAG CCC CGC CAG CTC TAC ATG CAA AGC 



Ala Val Ly3 Ala The Trp Asn Asp Lys Pro Gly Gin Leu Tyr Met Gin Ser 

GCC AAC CCC CCC GAC CTG GTG GAG CTG ATG GCC TAT CAG AAC TCC GGT GGC 
Ala Asn Pro Gly Asp Leu Val Asp Leu Met Ala Tyr Gin Asn Ser Gly Gly 

GCG CTG GTG TTC GAG CTG CGT GTC GTC AGT GCG CCG ACC GAC CCG GTC AAG 
Ala Leu Val Phe Asp Leu Arg Val Val Ser Ala Pro Thr Asp Pro Val Lys 

CTG CGC GTC GAT TGC GGC TGG CCC TGT CTG GGC GAG ATC GAC GTC ACC AGC 
Leu Arg val Asp Cys Gly Trp Pro cys Leu Gly Glu lie Asp Val Thr Ser 

GCC GTC AAG GCC CAG CCG GTC AAC GCC TGG AAG GAA GTG GCG GTG TCG CTG 
Ala Val Lys Ala Gin Pro Val Asn Ala Trp Lys Glu Val Ala Val Ser Leu 

CAG TGT TTC GCC GAC GCC GGC ACC GAC CTG GCC ATC GTC AAC ACG CCC TTC 
Gin Cys Phe Ala Asp Ala Gly Thr Asp Leu Ala He Val Asn Thr Pro Phe 

CTG ATG TAC ACG TCT GGC CGC TTC GAA GCT GCC GTC GCG AAC ATC CGT TGG 
Leu Met Tyr Thr Ser Gly Arg Phe Glu Ala Ala Val Ala Aan He Arg Trp 

GAG CCC AAG CGC ACG CCC AAC GTS GGG TGC AAC GGC GCA CCG ATC GCC GCC 
Glu Pro Lys Arg Thr Pro Asn Val Gly Cys Asn Gly Ala Pro He Ala Ala 

GCG CCT TGA 2711 
Ala Pro END 
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ATG AGC AAG AAA AAG TTC CTC ATC GTA TCT ATC TTA ACA ATC CTT TTA GTA 
Mee Ser Lys Lya Lys Phe Val II* Val Ser lis Leu Thr lie Leu Leu Val 

CAG GCA ATA TAT TTT GTA GAA AAG TAT CAT ACC TCT GAG GAC AAG TCA ACT 
Sin Ala II* Tyr Pha Val Glu Lys Tyr His Thr Ser GLu Asp Lys Sar Thr 

TCA AAT ACC TCA TCT ACA CCA CCC CAA ACA ACA CTT TCC ACT ACC AAG GTT 

Ser Asa Thr Ser Sar Thr Pro Pro Gla Thr Thr Leu Ser Thr Thr Lys Val 

CTC AAG ATT AGA TAC CCT GAT GAC GGT GAG TGG CCA GGA GCT CC7 ATT GAT 
Lau Lyt Ila Arg Tyr Pro Asp Asp Gly Glu Trp Pro Gly Ala Pre Ila Asa 

AAG GAT GGT GAT GGG AAC CCA GAA TTC TAC ATT GAA ATA AAC CTA TGG AAC 
Lys Asp Gly As? Gly Ash Pro Glu Phe Tyr lie Glu lie Asn Leu Tr? Asa. 

ATT CTT AAT GCT ACT GGA TTT GCT GAG ATG ACG TAG AAT TTA ACC AGC GGC 
Ila Leu Asn Ala Thr Gly Phe Ala Glu Met Thr Tyr Asn Leu Thr Ser Gly 

CTC CTT CAC TAC GTC CAA CAA CTT GAC AAC ATT GTC TTC AGG GAT AGA AGT 
Val Leu His Tyr Val Gin Glh Leu Asp Asn He Val Leu Arg Asp Arg Ser 



AAT TGG GTG CAT GGA TAC CCC GAA ATA TTC TAT GGA AAC AAG CCA TGG AAT 
Asn Trp Val His GLy Tyr Pro Glu lis Phe Tyr Gly Asn Lys Pro Trp Asn 

GCA AAC TAG GCA ACT GAT GGC CCA ATA CCA TTA CCC AGT AAA GTT TCA AAC 

Ala Asn Tyr Ala Thr Asp Gly Pro lie Pro Leu Pra Sar Lys Val Sar Asn 

CTA ACA GAC TTC TAT CTA ACA ATC TCC TAT AAA CTT GAG CCC AA3 AAC GC-C 
Leu Thr Asp Phe Tyr Leu Thr tie Ssr Tyr Lys Levi Glu Pro Lys Asn Gly 

CTG CCA ATT AAC TTC GCA ATA GAA TCC TGG TTA ACG AGA GAA GCT TGG AGA 
Leu Pro He Asn Phe Ala lie Glu Ser Trp Leu Thr Arg Glu Ala Trp Arc 

ACA ACA GGA ATT AAC AGC GAT GAG CAA GAA GTA ATG ATA TGG ATT TAC TAT 
Thr Thr Gly lie Asr. Ser Asp Glu Gin Glu Val Met lis Trp He Tyr Tyr 

GAC GGA TTA CAA CCG GCT GGC TCC AAA GTT AAG GAG ATT GTA GTC CCA ATA 
Asp Gly Leu Gla Pro Ala Gly Ser Lys Val Lys Glu Ha Val Val Pro lis 

ATA GTT AAC GGA ACA CCA GTA AAT GCT ACA TTT GAA GTA TGG AAG GCA AAC 

lie Val Asn Gly Thr Pro Val Asn Ala Thr Phe Glu Val Trp Lys Ala A3r. 

ATT GGT TGG GAG TAT GTT GCA TTT AGA ATA AAG ACC CCA ATC AAA GAG GGA 
He Gly Trp Glu Tyr Val Ala Phe Arg He Lys Thr Pro He Lys Glu Gly 



ACA GTG ACA ATT CCA TAC GGA GCA TIT ATA ACT GTT GCA GCC AAC ATT TCA 
Thr Val Thr lie Pro Tyr Gly Ala Phe Ue Ser val Ala Ala Asa lie Ser 

AGC TTA CCA AAT TAC ACA GAA CTT TAC TTA GAG GAC GTG GAG ATT GGA ACT 
Ser Leu Pro A*n Tyr Thr Glu Leu Tyr Leu Glu Asp Val Glu lie Gly Thr 

GAG TTT GGA ACQ CCA AGC ACT ACC TCC GCC CAC CTA GAG TC-G TC-G ATC ACA 
Glu Phe Gly Thr Pro Ser Thr Thr Ser Ala Kis Leu Glu Tr? Trp lie Thr 



AAC ATA ACA CTA ACT CCT CTA GAT AGA CCT CTT ATT TCC TAA 96 0 
Asn lie Thr Lau Thr Pro Leu Asp Arg Pro Leu lie Sar End 



\ff y # • j-»}\"<4 (Vibrio harveyi) ()a-y#91GP2) /D 3 ^' 

ATG AGA OCT AAC ACG ATG AAG CAA AAA GCG CTA TAT CGA GCA GTA GCA ATG 
Mac Arg Oly Asn Thr Me= Lys Gin Lys Ala Leu Tyr Arg Ala Val Ala Mae 



CCT TTG ACT GGT CTT SCO AAC GTC GCA TCC GCT AAT GAG ATS GTA AAT CCT 
Gly Leu S«r Gly Hu Ala Asn Val Ala Ser Ala Asa Glu Me: Val AS= Sr= 

GAT GGT GGT GTC GTA GTC GGT TAG TGG CAT AAC TGG TGC GAT GGC C-CT GGT 
aso Gly Gly val Val Val Gly Tyr Tr? His Asa Trp cys Asp Gly Ala Gly 

TAC AAG GGA GGT AAT GCA CCG TGT GTA ACA TTG GAT GAA GTT GAT CCT ATG 
Tyr Lys Gly Gly Asa Ala Pro cys val Tfcr Leu Asp Glu Val As? Jra Mac 

TAC AAT GTG GTT AAC GTC TCC TTT ATG AAG GTA TTC AAT ACC ACT GAA GGT 
Tyr Asa val Val Asa V.l Ser She Met Ly. Val BH* Asa TtlT Ser Glu Giy 

CGT ATT CCA ACC TTT AAG CTC GAT CCA AAT ATC GGC CTT TCA GAA CAA CAA 
Arg He 9« Thr Pha Lys Lau Asp Pro Asn lis Gly Leu Ser Glu Gin Glr. 

TTT TTT GAC CAA ATT GAA GCT CTA AAC CAA CAA GGA CGT GCC GTT CTC ATC 
Ph* Sh« Asa Gin Zls Glu Ala Lau Asa Gla Gla Gly Arg Ala Val Leu lie 



OCT CTT GGT GGC GCA GAT GCT CAC GTT GAA CTT AGA ACT GGT GAC CAA CAA 
Ala Leu Gly Gly Ala As? Ala His Val Glu Leu Arg Thr Gly Asp Glu Gin 

* 

GCG TTC GCA CAA GAG ATT ATT CGT TTA ACG GAT AAG TIC GGT TTT GAT GGT 
Ala Phe Ala Gin Glu lis 11* Arc Leu Thr Asp Lys Phe Gly Phe As? Gly 

CTA GAT ATC GAT TTA GAC- CAC- TCA GCA GTA ACG GCA GAG AAC AAC CAA ACT 
Leu Asp He Asp Lata Glu Gin Ser Ala val Thr Ala Glu Asa Asn Gin Thr 

GTA ATT CCA GCT GCA CTT CGC CTT GTA AAA GAG CAT TAT CAA CAA CAA GGT 
Val He Pro Ala Ala Leu Arc Leu val Lys Glu His Tyr Gin Gin Gla Gly 

AAG AAC TTC CTA ATT ACG ATG GCG CCT GAA TTC CCT TAT CTA ACA GAA GGT 
Lys Asn Phe Leu He Thr Ma= Ala Pro Glu Phe Pro Tyr Leu Thr Glu Gly 

GGC AAG TAT GTT CCT TAC ATT ACT GGT TTA GAA GGG TAC TAC GAT TGG ATC 
Gly Lys Tyr Val Pro Tyr IU Thr Gly Leu Glu Gly Tyr Tyr As? Trp tie 

AAC CCT CAG TTT TAC AAT CAA GGT GGT GAC GGT ATT TGG GTT GAT GGC GTG 
ash Pro Gla Phe Tyr Asn Gin Gly Gly Aso Gly He Trp Val As? Gly Val 

GGT TGG ATA GCG CAA AAC AAT GAT GAG TTA AAA CAA GAG TTT ATT TAC TAC 
Gly Trp lie Ala Gla Asn Asn Asp Glu Leu Lys Gla Glu Phe He Tyr Tyr 



ATT TCG GAC GCT CTA TCG AAC GGT ACA CGC GGT TTC CAC AAA ATC CCG CAT 
I La Ser Asp Ala Leu Ser Asn GLy Thr Arg GLy Phe His Lys lie Pro His 

GAC AAA CTG GTG TTT GGT AT" CCA TCT AAC ATT GAT GCT GCT GCA ACG GGC 
Asp Lys Leu Val Phe GLy lie Pro Ser Asu lie Asp Ala Ala Ala Thr Gly 

TTT GTT CAA AAC CCT CAA GAC CTT TAG GAC GCG TTT GAT CAA CTT. AAA GCG 
Phe Val GLa Asa Pro Gla Asp Leu Tyr Asp Ala Phe Asp Gla Leu Lys Ala 

CAA GCG CAG GCA CTT CGT GGC GTA ATG ACA TGG TCG GTG AAC TGG GAT ATG 
Gin, Gly GLti Ala Leu Arg Gly VaL Met Thr Trp Ser Val Asa Trp As? Me= 

GGC ACC GAT AAA AAT GGC CAA GCG TAC GGT GAA AAA TTC GTG AAG GAT TAG 
Gly Thr Asp Lys Asa Gly Gin Ala Tyr Gly Glu Lys Phe Val Lys Asp Tyr 

GGT CCG TTT ATC CAC GGG CAG ACT CCA CCA CCA AGT GAA GGT GAA CCA GTT 
Gly Pro Phe lie His Gly Gin Thr Pro Pro Pro Ser Glu Gly Glu Pro Val 

TTT AGT GGC CTC AAC GAT GTT CGT GTG CAT CAC GGT ACT TCA TTT GAC CCG 
Phe Ser Gly Lau Asn Asp Val Arg Val His His Gly Ser Ser Phe Asp Pro 

TAT GCA GGT GTT ACT GCG TCT GAT AAA GAA GAT GGA GAC CTA ACC AAC AGC 



Tyr Ala Gly Val Thr Ala «« Gly ^ Le * ^ *" 

ATC ACT GTC GAA GGT TCA , GTT GAT GTG AAC ACG GTA GGC ACA TAT GTT TTG 
II. Thr val Glu Gly sar Val Asp Val A S n Thr Val Gly Thr Tyr Val Leu 

GTT TAG AGT GTA AAA GAG AGC 'gAC AAC AAT GAA ACC AAG CAA AGT AGA ACG 
val Tyr Sar V.l Lys Mp S*r Asp Asn Asn Glu Thr Lys Gin Sar Arg Thr 

GTT GTT GTT TAG AGC CTA GTG CCT GAG TTT GAA GGT GTC GCA GAT ACG ACC 
val val val Tyr Ser Leu val Pro Glu Phe Glu Gly Val Ala Asp Thr Thr 

A TC CAG CTT GGT GAC GCT TTT GAC CCA ATG GCA GGC GTA AAA GCG ACC- GAT 
,U HU Leu Oly ^ Ala Pha Asp Pro «ec Ala Gly Val Lys Ala Thr Asp 

• GCA GAA GAC GGT GAT TTG ACT GAT CGG TAT CTA CGC CGC CTA AGG TCA CTT 
Ala- Glu Asp Gly Asp L.u Thr Asp Arg Tyr Leu Arg Arg Leu Arc S*r Lau 

CTG CGG TGC GAT AGC CTT CTG TGC CAT TTG GTG CAA CCG CCC AGT TTT CCA 
L eu Arg Cys Asp Ser Leu Lau Cys Hi, Leu Val Gin Pro Iro Sar Phe Pro 

GAC GCT CAA CGA TGG TTG CCA TCT CTT TCT GOT TCA 1514 
Asp Ala Gin Arg Trp Lau Pro Sar Leu 3er Gly END 
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WO 97/44361 



ENDOGLUCANASES 



PCT/US97/08793 



Field of the Invention 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production and isolation of such polynucleotides and polypeptides. More particularly, the 
polypeptides of the present invention have been identified as endoglucanases and in particular, 
enzymes having carboxymethyl cellulase activity. 

Background 

Cellulose, a fibrous, tough, water-insoluble substance is found in the cell walls of 
plants, particularly, in stalks, stems, trunks and all the woody portions of plant tissues. 
Cellulose constitutes much of the mass of wood, and cotton is almost pure cellulose. Because 
cellulose is a linear, unbranched homopolysaccharide of 10,000 to 15,000 D-gJucose units, it 
resembles amylose and the main chains of glycogen. But there is a very important difference, 
in cellulose, the glucose residues have the beta configuration, whereas in amylose, 
amylopectin and glycogen, the glucose is in the alpha configuration. The glucose residues in 
cellulose are linked by (beta 1-4) glycosidic bonds. This difference gives cellulose and 
amylose very different 3-dimensional structures and physical properties. 

Cellulose cannot be used by most animals as a source of stored fuel, because the (beta 
l->4) linkages of cellulose are not hydrolyzed by alpha-amylases. Termites readily digest 
cellulose but only because their intestinal tract harbors a symbiotic microorganism, 
trichonympha, which secretes cellulase, an enzyme that hydrolyses (beta 1-4) linkages 
between glucose units. The only vertebrates able to use cellulose as food are cattle and other 
ruminant animals (sheep, goats, camels and giraffes). The extra stomachs "rumens" of these 
animals teem with bacteria and protists that secrete cellulase. 
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The enzymatic hydrolysis of cellulose is considered to require the action of both 
endoglucanases (1,4-beta-D-glucan glucanohydrolase) and exoglucanases (1,4-beta-D-glucan 
cellobiohydrolase). A synergistic interaction of these enzymes is necessary for the complete 
hydrolysis of crystalline cellulose. (Caughlin, M.P., Genet. Eng. Rev., 3:39-109 (1985)). For 
the complete degradation of cellulose (cellulose to glucose), p-glucosidase might be required 
if the "exo" enzyme does not release glucose. 1,4-p-D-glucan glucohydrolase is another type 
of "exo" cellulase. 

Thermophilic bacteria have received considerable attention as sources of highly active 
and thermostable cellulolytic and xylanolytic enzymes (Bronneomeier, K. and Staudenbauer, 
W.L., D.R. Woods (Ed.), The Clostridia and Biotechnology, Butterworth Publishers, 
Stoneham, MA (1993). Recently, the most extremely thermophilic organotrophic eubacteria 
presently known have been isolated and characterized. These bacteria, which belong to the 
genus thermotoga, are fermentative microorganisms metabolizing a variety of carbohydrates 
(Huber, R. and Stetter, K.O., in Ballows, et at, (Ed.), The Procaryotes, 2nd Ed., Springer- 
Verlaz, New York, pgs. 3809-3819 (1992)). 

In Huber et al, 1986, Arch. Microbiol. 144:324-333, the isolation of the bacterium 
Thermotoga maritima is described. T. maritima is a eubacterium that is strictly anaerobic, 
rod- shaped, fermentative, hyperthermophilic, and grows between 55°C and 90°C, with an 
optimum growth temperature of about 80°C. This eubacterium has been isolated from 
geothermally heated sea floors in Italy and the Azores. T. maritima cells have a sheath-like 
structure and monotrichous flagellation. T. maritima is classified in the eubacterium kingdom 
by virtue of having murein and fatty acid-containing lipids, diphtheria-toxin-resistant 
elongation factor 2, an RNA polymerase subunit pattern, and sensitivity to antibiotics. 

Since, to date, most organisms identified from the archaeal domain are thermophiles or 
hyperthermophiles, archaeal bacteria are also considered a fertile source of thermophilic 
enzymes. 
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The present invention provides polynucleotides and polypeptides encoded thereby 
which have been identified as endoglucanase enzymes having carboxymethyl cellulase activity 
(CMC). 

In accordance with one aspect of the present invention, there is provided novel enzymes, as 
well as active fragments, analogs and derivatives thereof. 

In accordance with another aspect of the present invention, there are provided isolated 
nucleic acid molecules encoding enzymes of the present invention including mRNAs, DNAs, 
cDNAs, genomic DNAs as well as active analogs and fragments of such enzymes. 

In accordance with another aspect of the present invention there are provided isolated 
nucleic acid molecules encoding mature polypeptides expressed by the DNA contained in 
ATCC Deposit No. 97516. 

In accordance with yet a further aspect of the present invention, there is provided a 
process for producing such polypeptide by recombinant techniques comprising culturing 
recombinant prokaryotic and/or eukaryotic host cells, containing a nucleic acid sequence 
encoding an enzyme of the present invention, under conditions promoting expression of said 
enzyme and subsequent recovery of said enzyme. 

In accordance with yet a further aspect of the present invention, there is provided a 
process for utilizing such enzymes, or polynucleotide encoding such enzymes for degradation 
of cellulose for the conversion of plant biomass into fuels and chemicals, for use in 
detergents, the textile industry, in animal feed, in waste treatment, and in the fruit 
juice/brewing industry for the clarification and extraction of juices. 
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In accordance with yet a further aspect of the present invention, there is also provided 
nucleic acid probes comprising nucleic acid molecules of sufficient length to specifically 
hybridize to a nucleic acid sequence of the present invention. 

In accordance with yet a further aspect of the present invention, there is provided a 
process for utilizing such enzymes, or polynucleotides encoding such enzymes, for in vitro 
purposes related to scientific research, for example, to generate probes for identifying similar 
sequences which might encode similar enzymes from other organisms. 

These and other aspects of the present invention should be apparent to those skilled in 
the art from the teachings herein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings are illustrative of embodiments of the invention and are not 
meant to limit the scope of the invention as encompassed by the claims. 

Figures 1A-IX show the nucleotide and deduced amino acid sequences the enzymes of 
the present invention. Sequencing was performed using a 378 automated DNA sequencer 
(Applied Biosystems, Inc.). 
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The term "gene" means the segment of DNA involved in producing a polypeptide 
chain; it includes regions preceding and following the coding region (leader and trailer) as 
well as intervening sequences (introns) between individual coding segments (exons). 

A coding sequence is "operably linked to" another coding sequence when RNA 
polymerase will transcribe the two coding sequences into a single mRNA, which is then 
translated into a single polypeptide having amino acids derived from both coding sequences. 
The coding sequences need not be contiguous to one another so long as the expressed 
sequences are ultimately processed to produce the desired protein. 

"Recombinant" enzymes refer to enzymes produced by recombinant DNA techniques; 
i.e., produced from cells transformed by an exogenous DNA construct encoding the desired 
enzyme. "Synthetic" enzymes are those prepared by chemical synthesis. 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
enzyme, is a DNA sequence which is transcribed and translated into an enzyme when placed 
under the control of appropriate regulatory sequences. A "promotor sequence" is a DNA 
regulatory region capable of binding RNA polymerase in a cell and initiating transcription of 
a downstream (3' direction) coding sequence. The promoter is part of the DNA sequence. 
This sequence region has a start codon at its 3' terminus. The promoter sequence does 
include the minimum number of bases where elements necessary to initiate transcription at 
levels detectable above background. However, after the RNA polymerase binds the sequence 
and transcription is initiated at the start codon (3' terminus with a promoter), transcription 
proceeds downstream in the 3' direction. Within the promotor sequence will be found a 
transcription initiation site (conveniently defined by mapping with nuclease SI) as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. 
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The present invention provides purified thermostable enzymes that catalyze the 
hydrolysis of the beta 1,4 glycosidic bonds in cellulose to thereby degrade cellulose. An 
exemplary purified enzyme is an endoglucanase derived from an organism referred herein as 
"AEPIIla" which is a thermophilic archaeal bacteria which has a very high temperature 
optimum. The organism is strictly anaerobic, rod-shaped and fermentative, and grows 
between 55 and 90°C (optimally at 85°C). AEPIIla was discovered in a shallow marine 
hydrothermal area in Vulcano, Italy. The organism has coccoid cells occurring in singlets or 
pairs. AEPIIla grows optimally at 85°C and pH 6.5 in a marine medium with cellulose as a 
substrate and nitrogen in gas phase. This exemplary enzyme is shown in Figure 1A, SEQ ID 
NO:2. 

The polynucleotide encoding SEQ ID NO:2 was originally recovered from a genomic 
gene library derived from AEPIIla as described below. It contains an open reading frame 
encoding a protein of 553 amino acid residues. 

In one embodiment, the endoglucanase enzyme of SEQ ID NO:2 of the present 
invention has a molecular weight of about 60.9 kilodaltons as measured by SDS-PAGE gel 
electrophoresis and an inferred molecular weight from the nucleotide sequence of the gene. 
This purified enzyme may be used to catalyze the enzymatic degradation of cellulose where 
desired. The endoglucanase enzyme of the present invention has a very high thermostability 
and has the closest homology to endo-l,4-beta-glucanase from Xanthomonas campestris with 
50% identity and 71% similarity at the amino acid level. 

In accordance with an aspect of the present invention, there are provided isolated 
nucleic acid molecules (polynucleotides) which encode for the mature enzymes having the 
deduced amino acid sequence of Figure 1A-X. 

This invention, in addition to the isolated nucleic acid molecule encoding an 
endoglucanase enzyme disclosed in Figure 1 (SEQ ID NO:l), also provides substantially 
similar sequences. Isolated nucleic acid sequences are substantially similar if: (i) they are 
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capable of hybridizing under stringent conditions, hereinafter described, to SEQ ID NO:l; or 
(ii) they encode DNA sequences which are degenerate to SEQ ID NO:l. Degenerate DNA 
sequences encode the amino acid sequence of SEQ ID NO:2, but have variations in the 
nucleotide coding sequences. As used herein, "substantially similar" refers to the sequences 
having similar identity to the sequences of the instant invention. The nucleotide sequences 
that are substantially similar can be identified by hybridization or by sequence comparison. 
Enzyme sequences that are substantially similar can be identified by one or more of the 
following: proteolytic digestion, gel electrophoresis and/or microsequencing. 

One means for isolating a nucleic acid molecule encoding an endoglucanase enzyme is 
to probe a genomic gene library with a natural or artificially designed probe using art 
recognized procedures (see, for example: Current Protocols in Molecular Biology, Ausubel 
F.M. et al. (EDS.) Green Publishing Company Assoc. and John Wiley Interscience, New 
York, 1989, 1992). It is appreciated to one skilled in the art that SEQ ID NO:l, or fragments 
thereof (comprising at least 15 contiguous nucleotides), is a particularly useful probe. Other 
particular useful probes for this purpose are hybridizable fragments to the sequences of SEQ 
ID NO:l (i.e., comprising at least 15 contiguous nucleotides). 

With respect to nucleic acid sequences which hybridize to specific nucleic acid 
sequences disclosed herein, hybridization may be carried out under conditions of reduced 
stringency, medium stringency or even stringent conditions. As an example of oligonucleotide 
hybridization, a polymer membrane containing immobilized denatured nucleic acid is first 
prehybridized for 30 minutes at 45°C in a solution consisting of 0.9 M NaCl, 50 mM 
NaH 2 P0 4 , pH 7.0, 5.0 mM Na, EDTA, 0.5% SDS, 10X Denhardt's, and 0.5 mg/mL 
polyriboadenylic acid. Approximately 2 X 10 7 cpm (specific activity 4-9 X 10 cpm/ug) of 
"P end-labeled oligonucleotide probe are then added to the solution. After 12-16 hours of 
incubation, the membrane is washed for 30 minutes at room temperature in IX SET (150 mM 
NaCl, 20 mM Tris hydrochloride, pH 7.8, 1 mM Na^DTA) containing 0.5% SDS, followed 
by a 30 minute wash in fresh IX SET at Tm-10°C for the oligo-nucleotide probe. The 
membrane is then exposed to auto-radiographic film for detection of hybridization signals. 
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Stringent conditions means hybridization will occur only if there is at least 90% 
identity, preferably at least 95% identity and most preferably at least 97% identity between 
the sequences. See J. Sambrook et al, Molecular Cloning, A Laboratory Manual (2d Ed. 
1989) (Cold Spring Harbor Laboratory) which is hereby incorporated by reference in its 
entirety. 

"Identity" as the term is used herein, refers to a polynucleotide sequence which 
comprises a percentage of the same bases as a reference polynucleotide (SEQ ID NO:l). For 
example, a polynucleotide which is at least 90% identical to a reference polynucleotide, has 
polynucleotide bases which are identical in 90% of the bases which make up the reference 
polynucleotide and may have different bases in 10% of the bases which comprise that 
polynucleotide sequence. 

The present invention also relates to polynucleotides which differ from the reference 
polynucleotide such that the changes are silent changes, for example the changes do not alter 
the amino acid sequence encoded by the polynucleotide. The present invention also relates to 
nucleotide changes which result in amino acid substitutions, additions, deletions, fusions and 
truncations in the enzyme encoded by the reference polynucleotide (SEQ ID NO:l). In a 
preferred aspect of the invention these enzymes retain the same biological action as the 
enzyme encoded by the reference polynucleotide. 

It is also appreciated that such probes can be and are preferably labeled with an 
analytically detectable reagent to facilitate identification of the probe. Useful reagents include 
but are not limited to radioactivity, fluorescent dyes or enzymes capable of catalyzing the 
formation of a detectable product. The probes are thus useful to isolate complementary copies 
of DNA from other animal sources or to screen such sources for related sequences. 

The present invention provides substantially pure endoglucanase enzymes. The term 
"substantially pure" is used herein to describe a molecule, such as a polypeptide {e.g., an 
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endoglucanase polypeptide, or a fragment thereof) that is substantially free of other proteins, 
lipids, carbohydrates, nucleic acids, and other biological materials with which it is naturally 
associated. For example, a substantially pure molecule, such as a polypeptide, can be at least 
60%, by dry weight, the molecule of interest. The purity of the polypeptides can be determined 
using standard methods including, e.g., polyacrylamide gel electrophoresis (e.g., SDS-PAGE), 
column chromatography (e.g., high performance liquid chromatography (HPLC)), and amino- 
terminal amino acid sequence analysis. 

Endoglucanase polypeptides included in the invention can have one of the amino acid 
sequences of Endoglucanases shown in Figures 1A through IX (SEQ ID NO:NO:2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, and 48), for example, the 
amino acid sequence of AEPIIla (SEQ ID NO:2). Endoglucanase polypeptides, such as those 
isolated from AEPIIla , can be characterized by catalyzing the hydrolysis of the beta 1,4 
glycosidic bonds in cellulose. 

Also included in the invention are polypeptides having sequences that are 
"substantially identical" to the sequence of an endoglucanase polypeptide, such as one of SEQ 
IDNO:NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 
and 48, e.g., SEQ ID NO:2. A "substantially identical" amino acid sequence is a sequence 
that differs from a reference sequence only by conservative amino acid substitutions, for 
example, substitutions of one amino acid for another of the same class (e.g., substitution of 
one hydrophobic amino acid, such as isoleucine, valine, leucine, or methionine, for another, or 
substitution of one polar amino acid for another, such as substitution of arginine for lysine, 
glutamic acid for aspartic acid, or glutamine for asparagine), or by one or more non- 
conservative substitutions, deletions, or insertions, provided that the polypeptide retains at 
least one endoglucanase-specific activity or an endoglucanase-specific epitope. For example, 
one or more amino acids can be deleted from an endoglucanase polypeptide, resulting in 
modification of the structure of the polypeptide, without significantly altering its biological 
activity. For example, amino- or carboxyl-terminal amino acids that are not required for 
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endoglucanase biological activity, can be removed. Such modifications can result in the 
development of smaller active endoglucanase polypeptides. 



Other endoglucanase polypeptides included in the invention are polypeptides having 
amino acid sequences that are at least 50% identical to the amino acid sequence of an 
endoglucanase polypeptide, such as any of endoglucanases in SEQIDNO:NO:2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, and 48, e.g., SEQ ID NO:2. 
The length of comparison in determining amino acid sequence homology can be, for example, 
at least 15 amino acids, for example, at least 20, 25, or 35 amino acids. Homology can be 
measured using standard sequence analysis software (e.g., Sequence Analysis Software 
Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 
1710 University Avenue, Madison, WI 53705; also see Ausubel, et al., supra). 

The invention also includes fragments of endoglucanase polypeptides that retain at 
least one endoglucanase-specific activity or epitope. Endoglucanase activity can be assayed 
by examining the catalysis of beta 1,4 glycosidic bonds in cellulose. For example, an 
endoglucanase polypeptide fragment containing, e.g., at least 8-10 amino acids can be used as 
an immunogen in the production of endoglucanase-specific antibodies. The fragment can 
contain, for example, an amino acid sequence that is conserved in endoglucanases, and this 
amino acid sequence can contain amino acids that are conserved in endoglucanases. Such 
fragments can easily be identified by comparing the sequences of endoglucanases found in 
Figures 1A-1X. In addition to their use as peptide immunogens, the above-described 
endoglucanase fragments can be used in immunoassays, such as ELISAs, to detect the 
presence of endoglucanase-specific antibodies in samples. 

The endoglucanase polypeptides of the invention can be obtained using any of several 
standard methods. For example, endoglucanase polypeptides can be produced in a standard 
recombinant expression systems (see below), chemically synthesized (this approach may be 
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limited to small endoglucanase peptide fragments), or purified from organisms in which they 
are naturally expressed. 



The invention also provides isolated nucleic acid molecules that encode the 
endoglucanase polypeptides described above, as well as fragments thereof. For example, 
nucleic acids that encode any of SEQ ID NO:NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 36, 38, 40, 42, 44, 46, and 48, are included in the invention. These nucleic acids 
can contain naturally occurring nucleotide sequences, or sequences that differ from those of 
the naturally occurring nucleic acids that encode endoglucanases, but encode the same amino 
acids, due to the degeneracy of the genetic code. The nucleic acids of the invention can 
contain DNA or RNA nucleotides, or combinations or modifications thereof. Exemplary 
nucleic acids of the invention are shown in SEQ ID NO:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, and 47. 

By "isolated nucleic acid" is meant a nucleic acid, e.g. , a DNA or RNA molecule, that 
is not immediately contiguous with the 5* and 3' flanking sequences with which it normally is 
immediately contiguous when present in the naturally occurring genome of the organism from 
which it is derived. The term thus describes, for example, a nucleic acid that is incorporated 
into a vector, such as a plasmid or viral vector; a nucleic acid that is incorporated into the 
genome of a heterologous cell (or the genome of a homologous cell, but at a site different 
from that at which it naturally occurs); and a nucleic acid that exists as a separate molecule, 
e.g., a DNA fragment produced by PCR amplification or restriction enzyme digestion, or an 
RNA molecule produced by in vitro transcription. The term also describes a recombinant 
nucleic acid that forms part of a hybrid gene encoding additional polypeptide sequences that 
can be used, for example, in the production of a fusion protein. 

The nucleic acid molecules of the invention can be used as templates in standard 
methods for production of endoglucanase gene products (e.g., endoglucanase RNAs and 
endoglucanase polypeptides ). In addition, the nucleic acid molecules that encode 
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endoglucanase polypeptides (and fragments thereof) and related nucleic acids, such as 
(1) nucleic acids containing sequences that are complementary to, or that hybridize to, nucleic 
acids encoding endoglucanase polypeptides, or fragments thereof (e.g. , fragments containing 
at least 12, 15, 20, or 25 nucleotides); and (2) nucleic acids containing sequences that 
hybridize to sequences that are complementary to nucleic acids encoding endoglucanase 
polypeptides, or fragments thereof (e.g., fragments containing at least 12, 15, 20, or 25 
nucleotides); can be used in methods focused on their hybridization properties. For example, 
as is described in further detail below, such nucleic acid molecules can be used in the 
following methods: PCR methods for synthesizing endoglucanase nucleic acids, methods for 
detecting the presence of an endoglucanase nucleic acid in a sample, screening methods for 
identifying nucleic acids encoding new endoglucanase family members. 

The invention also includes methods for identifying nucleic acid molecules that encode 
members of the endoglucanase polypeptide family in addition to SEQ ID NO:NO:2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, and 48. In these 
methods, a sample, e.g., a nucleic acid library, such as a cDNA library, that contains a nucleic 
acid encoding an endoglucanase polypeptide is screened with an endoglucanase-specific probe, 
e.g., an endoglucanase-specific nucleic acid probe. Endoglucanase-specific nucleic acid 
probes are nucleic acid molecules (e.g., molecules containing DNA or RNA nucleotides, or 
combinations or modifications thereof) that specifically hybridize to nucleic acids encoding 
endoglucanase polypeptides, or to complementary sequences thereof. The term 
"endoglucanase-specific probe," in the context of this method of invention, refers to probes 
that bind to nucleic acids encoding endoglucanase polypeptides, or to complementary 
sequences thereof, to a detectably greater extent than to nucleic acids encoding other enzymes, 
or to complementary sequences thereof. 

The invention facilitates production of endoglucanase-specific nucleic acid probes. 
Methods for obtaining such probes can be designed based on the amino acid sequences shown 
in Figure 1. The probes, which can contain at least 12, e.g., at least 15, 25, 35, 50, 100, or 
150 nucleotides, can be produced using any of several standard methods (see, e.g., Ausubel, 
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et al, supra). For example, preferably, the probes are generated using PCR amplification 
methods. In these methods, primers are designed that correspond to endoglucanase-conserved 
sequences (see Figure 1), which can include endoglucanase-specific amino acids, and the 
resulting PCR product is used as a probe to screen a nucleic acid library, such as a cDNA 
library. 

In accordance with another aspect of the present invention, there is provided an 
isolated polynucleotide encoding an exemplary enzyme of the present invention (SEQ ID 
NO:l) which has been deposited with an appropriate depository for the deposit of biological 
material. The deposited material is a pQET (Qiagen, Inc.) piasmid comprising the DNA of 
Figure 1A. The deposit has been deposited with the American Type Culture Collection, 
12301 Parklawn Drive, Rockville, Maryland 20852, USA, on April 22, 1996 and assigned 
ATCC Deposit No. 97516. 

The deposit has been made under the terms of the Budapest Treaty on the International 
Recognition of the deposit of micro-organisms for purposes of patent procedure. The strain 
will be irrevocably and without restriction or condition released to the public upon the 
issuance of a patent. The deposit is provided merely as convenience to those of skill in the 
art and are not an admission that a deposit be required under 35 U.S.C. §1 12. The sequences 
of the polynucleotides contained in the deposited materials, as well as the amino acid 
sequences of the polypeptides encoded thereby, are controlling in the event of any conflict 
with any description of sequences herein. A license may be required to make, use or sell the 
deposited materials, and no such license is hereby granted. 

The coding sequences for the endoglucanase enzymes of the present invention were 
identified by preparing an AEPIIla genomic DNA library, for example, and screening the 
library for the clones having endoglucanase activity. Such methods for constructing a 
genomic gene library are well-known in the art. One means, for example, comprises shearing 
DNA isolated from AEPIIla by physical disruption. A small amount of the sheared DNA is 
checked on an agarose gel to verify that the majority of the DNA is in the desired size range 
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(approximately 3-6 kb). The DNA is then blunt ended using Mung Bean Nuclease, incubated 
at 37°C and phenol/chloroform extracted. The DNA is then methylated using Eco RI 
Methylase. Eco RI linkers are then ligated to the blunt ends through the use of T4 DNA 
ligase and incubation at 4°C. The ligation reaction is then terminated and the DNA is cut- 
back with Eco RI restriction enzyme. The DNA is then size fractionated on a sucrose 
gradient following procedures known in the art, for example, Maniatis, T., et al, Molecular 
Cloning. Cold Spring Harbor Press, New York, 1982, which is hereby incorporated by 
reference in its entirety. 

A plate assay is then performed to get an approximate concentration of the DNA. 
Ligation reactions are then performed and 1 |il of the ligation reaction is packaged to 
construct a library. Packaging, for example, may occur through the use of purified A.gtl 1 
phage arms cut with EcoRI and DNA cut with EcoRI after attaching EcoRI linkers. The 
DNA and Agtl 1 arms are ligated with DNA ligase. The ligated DNA is then packaged into 
infectious phage particles. The packaged phages are used to infect E. coli cultures and the 
infected cells are spread on agar plates to yield plates carrying thousands of individual phage 
plaques. The library is then amplified. 
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In a preferred embodiment, the enzyme of the present invention, was isolated from a 
AEPIIla library by the following technique: 



1. Agtl 1 AEPIIla library was plated onto 6 LB/GelRite/0.1% CMC/NZY agar plates 
(-4,800 plaque forming units/plate) in E.coli Y1090 host with LB agarose containing ImM 
IPTG as top agarose. The plates were incubated at 37°C overnight. 
. 2. Plates were chilled at 4°C for one hour. 

3. The plates were overlayed with Duralon membranes (Stratagene) at room 
temperature for one hour and the membranes were oriented and lifted off the plates and stored 
at 4°C. 

4. The top agarose layer was removed and plates were incubated at 72°C for ~3 

hours. 

5. The plate surface was rinsed with NaCl. 

6. The plate was stained with 0.1% Congo Red for 15 minutes. 

7. The plate was destained with 1M NaCl. 

8. The putative positives identified on plate were isolated from the Duralon 
membrane (positives are identified by clearing zones around clones). The phage was eluted 
from the membrane by incubating in 500(il SM + 25 ul CHC1 3 to elute. 

9. Insert DNA was subcloned into pBluescript II SK(+) cloning vector 
(Stratagene), and subclones were reassayed for CMCase activity using the following protocol: 

i) Spin 1ml overnight miniprep of clone at maximum speed for 3 minutes. 

ii) Decant the supernatant and use it to fill "wells" that have been made in an 
LB/GelRite/0.1% CMC plate. 

iii) Incubate at 72°C for 2 hours. 

iv) Stain with 0.1% Congo Red for 15 minutes. 

v) Destain with 1M NaCl for 15 minutes. 

vi) Identify positives by clearing zone around clone. 
Fragments of the full length gene of the present invention may be used as a 

hybridization probe for a cDNA or a genomic library to isolate the full length DNA and to 
isolate other DNAs which have a high sequence similarity to the gene or similar biological 
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activity. Probes of this type have at least 10, preferably at least 15, and even more preferably 
at least 30 bases and may contain, for example, at least 50 or more bases. The probe may 
also be used to identify a DNA clone corresponding to a full length transcript and a genomic 
clone or clones that contain the complete gene including regulatory and promotor regions, 
exons, and introns. 

The isolated nucleic acid sequences and other en2ymes may then be measured for 
retention of biological activity characteristic to the enzyme of the present invention, for 
example, in an assay for detecting enzymatic endoglucanase activity. Such enzymes include 
truncated forms of endoglucanase, and variants such as deletion and insertion variants. 

Examples of such assays include an assay for the detection of endoglucanase activity 
based on specific interaction of direct dyes such as Congo red with polysaccharides. This 
colorant reacts with beta-l,4-glucans causing a visible red shift (Wood, P.J., Carbohydr. Res., 
85:271 (1980) and Wood, P.J., Carbohydr. Res., 94:cl9 (1981)). The preferred substrate for 
the test is carboxymethylcellulose (CMC) which can be obtained from different sources 
(Hercules Inc., Wilmington, DE, Type 4M6F or Sigma Chemical Company, St. Louis, MO, 
Medium Viscosity). The CMC is incorporated as the main carbon sources into a minimal 
agar medium in quantities of 0.1-1.0%. The microorganisms can be screened directly on 
these plates, but the replica plating technique from a master plate is preferable since the 
visualization of the activity requires successive flooding with the reagents, which would 
render the reisolation of active colonies difficult. Such endoglucanase-producing colonies are 
detectable after a suitable incubation time (1-3 days depending on the growth), by flooding 
the plate with 10 ml of a 0.1% aqueous solution of Congo Red. The coloration is terminated 
after 20 minutes by pouring off the dye and flooding the plates with 10 ml of,5M NaCl 
solution (commercial salt can be used). After an additional 20 minutes, the salt solution is 
discarded and endoglucanase activity is revealed by a pale-orange zone around the active 
microorganisms. In some cases, these zones can be enhanced by treating the plates with 1 N 
acetic acid, causing the dye to change its color to blue. 
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The same technique can be used as a cup-plate diffusion assay with excellent 
sensitivity for the determination of endoglucanase activity in culture filtrates or during enzytm 
purification steps (Carger, J.H., Anal. Biochem., 153:75 (1986)). See generally, Methods for 
Measuring Cellulase Activities, Methods in Enzymology, Vol. 160, pgs. 87-116. 

The enzyme of the present invention has enzymatic activity with respect to the 
hydrolysis of the beta 1,4 glycosidic bonds in carboxymethylcellulose, since the halos 
discussed above are shown around the colonies. 



The polynucleotide of the present invention may be in the form of DNA which DNA 
includes cDNA, genomic DNA, and synthetic DNA. The DNA may be double-stranded or 
single-stranded, and if single stranded may be the coding strand or non-coding (anti-sense) 
strand. The coding sequence which encodes the mature enzyme may be identical to the 
coding sequences shown in Figure 1 and/or that of the deposited clone (SEQ ID NO:l), or 
may be a different coding sequence which coding sequence, as a result of the redundancy or 
degeneracy of the genetic code, encodes the same mature enzyme as the DNA of Figure 1 
(e.g., SEQ ID NO:l). 



The polynucleotide which encodes for the mature enzyme of Figure 1 (e.g., SEQ ID 
NO:2) may include, but is not limited to: only the coding sequence for the mature enzyme; 
the coding sequence for the mature enzyme and additional coding sequence such as a leader 
sequence or a proprotein sequence; the coding sequence for the mature enzyme (and 
optionally additional coding sequence) and non-coding sequence, such as introns or non- 
coding sequence 5' and/or 3' of the coding sequence for the mature enzyme. 



Thus, the term "polynucleotide encoding an enzyme (protein)" encompasses a 
polynucleotide which includes only coding sequence for the enzyme as well as a 
polynucleotide which includes additional coding and/or non-coding sequence. 
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The present invention further relates to variants of the hereinabove described 
polynucleotides which encode for fragments, analogs and derivatives of the enzyme having the 
deduced amino acid sequence of Figure 1 (e.g., SEQ ID NO:2), The variant of the 
polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. 

Thus, the present invention includes polynucleotides encoding the same mature enzyme 
as shown in Figure 1 as well as variants of such polynucleotides which variants encode for a 
fragment, derivative or analog of the enzyme of Figure 1. Such nucleotide variants include 
deletion variants, substitution variants and addition or insertion variants. 

As hereinabove indicated, the polynucleotide may have a coding sequence which is a 
naturally occurring allelic variant of the coding sequence shown in Figure 1. As known in the 
art, an allelic variant is an alternate form of a polynucleotide sequence which may have a 
substitution, deletion or addition of one or more nucleotides, which does not substantially alter 
the function of the encoded enzyme. 

The present invention also includes polynucleotides, wherein the coding sequence for 
the mature enzyme may be fused in the same reading frame to a polynucleotide sequence 
which aids in expression and secretion of an enzyme from a host cell, for example, a leader 
sequence which functions to control transport of an enzyme from the cell. The enzyme 
having a leader sequence is a preprotein and may have the leader sequence cleaved by the 
host cell to form the mature form of the enzyme. The polynucleotides may also encode for a 
proprotein which is the mature protein plus additional 5' amino acid residues. A mature 
protein having a prosequence is a proprotein and is an inactive form of the protein. Once the 
prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may encode for a 
mature enzyme, or for an enzyme having a prosequence or for an enzyme having both a 
prosequence and a presequence (leader sequence). 
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The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 70%, preferably at least 90%, and more 
preferably at least 95% identity between the sequences. The present invention particularly 
relates to polynucleotides which hybridize under stringent conditions to the hereinabove- 
described polynucleotides. As herein used, the term "stringent conditions" means 
hybridization will occur only if there is at least 95% and preferably at least 97% identity 
between the sequences. The polynucleotides which hybridize to the hereinabove described 
polynucleotides in a preferred embodiment encode enzymes which either retain substantially 
the same biological function or activity as the mature enzyme encoded by the DNA of Figui 
1. 



Alternatively, the polynucleotide may have at least 15 bases, preferably at least 30 
bases, and more preferably at least 50 bases which hybridize to a polynucleotide of the 
present invention and which has an identity thereto, as hereinabove described, and which may 
or may not retain activity. For example, such polynucleotides may be employed as probes for 
the polynucleotide of SEQ ID NO: I, for example, for recovery of the polynucleotide or as a 
PCR primer. 



Thus, the present invention is directed to polynucleotides having at least a 70% 
identity, preferably at least 90% identity and more preferably at least a 95% identity to a 
polynucleotide which encodes the enzyme of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, or 48 as well as fragments thereof, which 
fragments have at least 30 bases and preferably at least 50 bases and to enzymes encoded by 
such polynucleotides. 

The present invention further relates to a enzyme which has the deduced amino acid 
sequence of Figure 1, as well as fragments, analogs and derivatives of such enzyme. 

The terms "fragment," "derivative" and "analog" when referring to the enzyme of 
Figure 1 means a enzyme which retains essentially the same biological function or activity as 
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such enzyme. Thus, an analog includes a proprotein which can be activated by cleavage of 
the proprotein portion to produce an active mature enzyme. 



The enzyme of the present invention may be a recombinant enzyme, a natural enzyme 
or a synthetic enzyme, preferably a recombinant enzyme. 

The fragment, derivative or analog of the enzyme of Figure 1 may be (i) one in 
which one or more of the amino acid residues are substituted with a conserved or non- 
conserved amino acid residue (preferably a conserved amino acid residue) and such substituted 
amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which 
one or more of the amino acid residues includes a substituent group, or (iii) one in which the 
mature enzyme is fused with another compound, such as a compound to increase the half-life 
of the enzyme (for example, polyethylene glycol), or (iv) one in which the additional amino 
acids are fused to the mature enzyme, such as a leader or secretory sequence or a sequence 
which is employed for purification of the mature enzyme or a proprotein sequence. Such 
fragments, derivatives and analogs are deemed to be within the scope of those skilled in the 
art from the teachings herein. 

The enzymes and polynucleotides of the present invention are preferably provided in 
an isolated form, and preferably are purified to homogeneity. 

The term "isolated" means that the material is removed from its original environment 
(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or enzyme present in a living animal is not isolated, but the same 
polynucleotide or enzyme, separated from some or all of the coexisting materials in the 
natural system, is isolated. Such polynucleotides could be part of a vector and/or such 
polynucleotides or enzymes could be part of a composition, and still be isolated in that such 
vector or composition is not part of its natural environment. 
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The enzymes of the present invention include an enzyme of Figure 1A-1X (in 
particular the mature enzyme) as well as enzymes which have at least 70% similarity 
(preferably at least 70% identity) to an enzyme of Figure 1A-1X and more preferably at least 
90% similarity (more preferably at least 90% identity) to an enzyme of Figure 1A-1X and 
still more preferably at least 95% similarity (still more preferably at least 95% identity) to an 
enzyme of Figure 1 A- IX and also include portions of such enzymes with such portion of the 
enzyme generally containing at least 30 amino acids and more preferably at least 50 amino 
acids. 

As known in the art "similarity" between two enzymes is determined by comparing the 
amino acid sequence and its conserved amino acid substitutes of one enzyme to the sequence 
of a second enzyme. Similarity may be determined by procedures which are well-known in 
the art, for example, a BLAST program (Basic Local Alignment Search Tool at the National 
Cneter for Biological Information). 

A variant, i.e. a "fragment", "analog" or "derivative" enzyme, and reference enzyme 
may differ in amino acid sequence by one or more substitutions, additions, deletions, fusions 
and truncations, which may be present in any combination. 

Among preferred variants are those that vary from a reference by conservative amino 
acid substitutions. Such substitutions are those that substitute a given amino acid in a 
polypeptide by another amino acid of like characteristics. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, 
Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues 
Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the basic 
residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. 

Most highly preferred are variants which retain the same biological function and 
activity as the reference polypeptide from which it varies. 
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Fragments or portions of the enzymes of the present invention may be employed for 
producing the corresponding full-length enzyme by peptide synthesis; therefore, the fragments 
may be employed as intermediates for producing the full-length enzymes. Fragments or 
portions of the polynucleotides of the present invention may be used to synthesize full-length 
polynucleotides of the present invention. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of enzymes of the invention by recombinant techniques. 

Host cells are genetically engineered (transduced or transformed or transfected) with 
the vectors containing the polynucleotides of this invention. Such vectors may be, for 
example, a cloning vector or an expression vector. The vector may be, for example, in the 
form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be cultured in 
conventional nutrient media modified as appropriate for activating promoters, selecting 
transformants or amplifying the genes of the present invention. The culture conditions, such 
as temperature, pH and the like, are those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be employed for producing enzymes 
by recombinant techniques. Thus, for example, the polynucleotide may be included in any one 
of a variety of expression vectors for expressing an enzyme. Such vectors include 
chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 
bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from 
combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox 
virus, and pseudorabies. However, any other vector may be used as long as it is replicable 
and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
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endonuclease site(s) by procedures known in the art. 
to be within the scope of those skilled in the art. 
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Such procedures and others are deemed 



The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coll. 
lac or trp, the phage lambda P L promoter and other promoters known to control expression of 
genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also contains 
a ribosome binding site for translation initiation and a transcription terminator. The vector 
may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The vector containing the appropriate DNA sequence as hereinabove described, as well 
as an appropriate promoter or control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. 

As representative examples of appropriate hosts, there may be mentioned: bacterial 
cells, such as E. coli, Streptomyces, Bacillus subtilis; fungal cells, such as yeast; insect cells 
such as Drosophila S2 and Spodoptera SJ9; animal cells such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection of an appropriate host is deemed to 
be within the scope of those skilled in the art from the teachings herein. 

More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
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embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters 
are known to those of skill in the art, and are commercially available. The following vectors 
are provided by way of example; Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBluescript II 
(Stratagene); pTRC99a, pKK223-3, pDR540, pRIT2T (Pharmacia); Eukaryotic: pXTl, pSG5 
(Stratagene) pSVK3, pBPV, pMSG, pSVLSV40 (Pharmacia). However, any other plasmid or 
vector may be used as long as they are replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda P R , P L and trp. Eukaryotic promoters include CMV immediate early, HSV thymidine 
kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to host cells containing the 
above-described constructs. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be 
effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, or 
eiectroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular Biology, 
(1986)). 

The constructs in host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. Alternatively, the enzymes of the invention 
can be synthetically produced by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells 
under the control of appropriate promoters. Cell-free translation systems can also be 
employed to produce such proteins using RNAs derived from the DNA constructs of the 
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present invention. Appropriate cloning and expression vectors for use with prokaryotic and 
eukaryotic hosts are described by Sambrook et al, Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 



Transcription of the DNA encoding the enzymes of the present invention by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA, usually about from 10 to 300 bp that act on a promoter to increase 
its transcription. Examples include the SV40 enhancer on the late side of the replication origin 
bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 



Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and 5. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed 
gene to direct transcription of a downstream structural sequence. Such promoters can be 
derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), 
oc-factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably, a leader sequence capable of directing secretion of translated 
enzyme. Optionally, the heterologous sequence can encode a fusion enzyme including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 



Useful expression vectors for bacterial use are constructed by inserting a structural 
DNA sequence encoding a desired protein together with suitable translation initiation and 
termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium 
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and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, 
although others may also be employed as a matter of choice. 



As a representative but nonlimiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, WI, 
USA). These pBR322 "backbone" sections are combined with an appropriate promoter and 
the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 

Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents, such methods are well known to those skilled in the art. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey 
kidney fibroblasts, described by Gluzman, Cell, 23:175 (1981), and other cell lines capable of 
expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. DNA sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 
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The enzyme can be recovered and purified from recombinant cell cultures by methods 
including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin chromatography. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

The enzymes of the present invention may be a naturally purified product, or a product 
of chemical synthetic procedures, or produced by recombinant techniques from a prokaryotic 
or eukaryotic host (for example, by bacterial, yeast, higher plant, insect and mammalian cells 
in culture). Depending upon the host employed in a recombinant production procedure, the 
enzymes of the present invention may be glycosylated or may be non-glycosylated. Enzymes 
of the invention may or may not also include an initial methionine amino acid residue. 

The enzyme of this invention may be employed for any purpose in which such enzyme 
activity is necessary or desired. In a preferred embodiment the enzyme is employed for 
catalyzing the hydrolysis of cellulose. The degradation of cellulose may be used for the 
conversion of plant biomass into fuels and chemicals. 

The enzyme of the present invention may also be employed in the detergent and textile 
industry, in the production of animal feed, in waste treatment and in the fruit juice/brewing 
industry for the clarification and extraction of juices. 

In a preferred embodiment, the enzyme of the present invention is a thermostable 
enzyme which is stable to heat and is heat resistant and catalyzes the enzymatic hydrolysis of 
cellulose, i.e., the enzyme is able to renature and regain activity after a brief (i.e., 5 to 30 
seconds), or longer period, for example, minutes or hours, exposure to temperatures of 80°C 
to 105°C and has a temperature optimum above 60°C. 
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The enzymes, their fragments or other derivatives, or analogs thereof, or cells 
expressing them can be used as an immunogen to produce antibodies thereto. These 
antibodies can be, for example, polyclonal or monoclonal antibodies. The present invention 
also includes chimeric, single chain, and humanized antibodies, as well as Fab fragments, or 
the product of an Fab expression library. Various procedures known in the art may be used 
for the production of such antibodies and fragments. 

Antibodies generated against the enzymes corresponding to a sequence of the present 
invention can be obtained by direct injection of the enzymes into an animal or by 
administering the enzymes to an animal, preferably a nonhuman. The antibody so obtained 
will then bind the enzymes itself. In this manner, even a sequence encoding only a fragment 
of the enzymes can be used to generate antibodies binding the whole native enzymes. Such 
antibodies can then be used to isolate the enzyme from cells expressing that enzyme. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein, 1975, Nature, 256:495-497), the trioma technique, the human 
B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic enzyme 
products of this invention. Also, transgenic mice may be used to express humanized 
antibodies to immunogenic enzyme products of this invention. 

Antibodies generated against the enzyme of the present invention may be used in 
screening for similar enzymes from other organisms and samples. Such screening techniques 
are known in the art, for example, one such screening assay is described in "Methods for 
Measuring Cellulase Activities", Methods in Enzymology, Vol 160, pp. 87-116, which is 
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hereby incorporated by reference in its entirety. Antibodies may also be employed as a probe 
to screen gene libraries generated from this or other organisms to identify this or cross 



reactive activities. 



Isolation and purification of polypeptides produced in the systems described above can 
be carried out using conventional methods, appropriate for the particular system. For example, 
preparative chromatography and immunological separations employing antibodies, such as 
monoclonal or polyclonal antibodies, can be used. 



The term "antibody," as used herein, refers to intact immunoglobulin molecules, as well 
as fragments of immunoglobulin molecules, such as Fab, Fab', (Pab% Fv, and SCA fragments, 
that are capable of binding to an epitope of an endoglucanase polypeptide. These antibody 
fragments, which retain some ability to selectively bind to the antigen (e.g., an endoglucanase 
antigen) of the antibody from which they are derived, can be made using well known methods in 
the art (see, e.g., Harlow and Lane, supra), and are described further, as follows. 

(1) A Fab fragment consists of a monovalent antigen-binding fragment of an antibody molecule, 
and can be produced by digestion of a whole antibody molecule with the enzyme papain, to yield 
a fragment consisting of an intact light chain and a portion of a heavy chain. 

(2) A Fab' fragment of an antibody molecule can be obtained by treating a whole antibody 
molecule with pepsin, followed by reduction, to yield a molecule consisting of an intact light 
chain and a portion of a heavy chain. Two Fab' fragments are obtained per antibody molecule 
treated in this manner. 



(3) A (Fab') 2 fragment of an antibody can be obtained by treating a whole antibody molecule 
with the en2yme pepsin, without subsequent reduction. A (Fab') 2 fragment is a dimer of two Fab' 
fragments, held together by two disulfide bonds. 
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(4) An Fv fragment is defined as a genetically engineered fragment containing the variable 
region of a light chain and the variable region of a heavy chain expressed as two chains. 



(5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
containing the variable region of a light chain and the variable region of a heavy chain, linked by 
a suitable, flexible polypeptide linker. 

As used in this invention, the term "epitope" refers to an antigenic determinant on an 
antigen, such as an endoglucanase polypeptide, to which the paratope of an antibody, such as an 
endoglucanase-specific antibody, binds. Antigenic determinants usually consist of chemically 
active surface groupings of molecules, such as amino acids or sugar side chains, and can have 
specific three-dimensional structural characteristics, as well as specific charge characteristics. 

As is mentioned above, antigens that can be used in producing endoglucanase-specific 
antibodies include endoglucanase polypeptides, e.g., any of the endoglucanases shown in Figures 
1 A-1X polypeptide fragments. The polypeptide or peptide used to immunize an animal can be 
obtained by standard recombinant, chemical synthetic, or purification methods. As is well 
known in the art, in order to increase immunogenicity, an antigen can be conjugated to a carrier 
protein. Commonly used carriers include keyhole limpet hemocyanin (KLH), thyroglobulin, 
bovine serum albumin (BSA), and tetanus toxoid. The coupled peptide is then used to immunize 
the animal (e.g., a mouse, a rat, or a rabbit). In addition to such carriers, well known adjuvants 
can be administered with the antigen to facilitate induction of a strong immune response. 

Endoglucanase-specific polyclonal and monoclonal antibodies can be purified, for 
example, by binding to, and elution from, a matrix containing an endoglucanase polypeptide, 
e.g., the endoglucanase polypeptide (or fragment thereof) to which the antibodies were raised. 
Additional methods for antibody purification and concentration are well known in the art and can 
be practiced with the endoglucanase-specific antibodies of the invention (see, for example, C- 
oligan, et al. , Unit 9, Current Protocols in Immunology, Wiley Interscience, 1 994). 
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Anti-idiotype antibodies corresponding to endoglucanase-specific antigens are also 
included in the invention, and can be produced using standard methods. These antibodies are 
raised to endoglucanase-specific antibodies, and thus mimic endoglucanase-specific epitopes. 

The members of a pair of molecules (e.g., an antibody-antigen pair or a nucleic acid 
pair) are said to "specifically bind" to each other if they bind to each other with greater 
affinity than to other, non-specific molecules. For example, an antibody raised against an 
antigen to which it binds more efficiently than to a non-specific protein can be described as 
specifically binding to the antigen. (Similarly, a nucleic acid probe can be described as 
specifically binding to a nucleic acid target if it forms a specific duplex with the target by 
base pairing interactions (see above).) 

The present invention is further described with reference to the following examples; 
however, it is to be understood that the present invention is not limited to such examples. All 
parts or amounts, unless otherwise specified, are by weight. 

In one aspect of the invention, a method for producing an endoglucanase enzyme, such 
as those shown in Figures 1A-1X, is provided. The method includes growing a host cell 
which contains a polynucleotide encoding the enzyme (e.g., SEQ ID NO: NO:l, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, or 47), under conditions 
which allow the expression of the nucleic acid, and isolating the enzyme encoded by the 
nucleic acid. Methods of culturing the host cell are described in the Examples and are known 
by those of skill in the art. 

In another embodiment, the invention provides a method for degrading carboxy- 
methylcellulose. The method includes contacting carboxymethylcellulose with a degrading 
effective amount of an enzyme of the invention, such as the enzyme shown in SEQ ID NO:2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, or 48. The 
term "degrading effective" amount refers to the amount of enzyme which is required to 
degrade at least 50% of the carboxymethylcellulose, as compared to carboxymethylcellulose 
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not contacted with the enzyme. Preferably, at least 80% of the carfaoxymethylcellulose is 
degraded. 



In another embodiment, the invention provides a method for hydrolyzing the beta 1,4 
glycosidic bond in cellulose, the method including administering an effective amount of an 
enzyme of the invention (e.g., SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 36, 38, 40, 42, 44, 46, or 48) to cellulose, to hydrolyze the glycosidic bond. An 
"effective" amount refers to the amount of enzyme which is required to hydrolyze at least 
50% of the glycosidic bonds, as compared to carboxymethylcellulose not contacted with the 
enzyme. Preferably, at least 80% of the glycosidic bonds are hydrolyzed. 

In order to facilitate understanding of the following examples certain frequently 
occurring methods and/or terms will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed by capital letters 
and/or numbers. The starting plasmids herein are either commercially available, publicly 
available on an unrestricted basis, or can be constructed from available plasmids in accord 
with published procedures. In addition, equivalent plasmids to those described are known in 
the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme 
that acts only at certain sequences in the DNA. The various restriction enzymes used herein 
are commercially available and their reaction conditions, cofactors and other requirements 
were used as would be known to the ordinarily skilled artisan. For analytical purposes, 
typically 1 jig of plasmid or DNA fragment is used with about 2 units of enzyme in about 20 
ul of buffer solution. For the purpose of isolating DNA fragments for plasmid construction, 
typically 5 to 50 jig of DNA are digested with 20 to 250 units of enzyme in a larger volume. 
Appropriate buffers and substrate amounts for particular restriction enzymes are specified by 
the manufacturer. Incubation times of about 1 hour at 37°C are ordinarily used, but may vary 
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in accordance with the supplier's instructions. After digestion the reaction is electrophoresed 
directly on a polyacrylamide gel to isolate the desired fragment. 



Size separation of the cleaved fragments is generally performed using 8 percent 
polyacrylamide gel described by Goeddel, D. et a/., Nucleic Acids Res., 8:4057 (1980), for 
example. 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides may or may not have a 5' phosphate. Those that do not will not 
ligate to another oligonucleotide without adding a phosphate with an ATP in the presence of 
kinase. A synthetic oligonucleotide will ligate to a fragment that has not been 
dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two double 
stranded nucleic acid fragments (Maniatis, T., et al., Id., p. 146). Unless otherwise provided, 
ligation may be accomplished using known buffers and conditions with 10 units of T4 DNA 
ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the DNA fragments to be 
ligated. 

Unless otherwise stated, transformation was performed as described in the method of 
Sambrook, Fritsch and Maniatus, 1989. The following examples are intended to illustrate, but 
not to limit, the invention. While the procedures described in the examples are typical of those 
that can be used to carry out certain aspects of the invention, other procedures known to those 
skilled in the art can also be used. The following materials and methods were used in carrying 
out the experiments described in the examples. 
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Example 1 

Bacterial Expression and Purification of Endoglucanase 

An AEPIIla genomic library was constructed in the Lambda gtll cloning vector 
(Stratagene Cloning Systems). The library was screened in Y1090 E. coli cells (Stratagene) for 
endoglucanase activity and a positive clone was identified and isolated. DNA of this clone was 
used as a template in a 100 ul PCT reaction using the following primer sequences: 
5' primer: AATAGCGGCCGCAAGCTTATCGACGGTTTCCATATGGGGATTGGTG (SEQ 
ID NO:49). 3' primer: AATAGCGGCCGCGGATCCAGACCAACTGG 
TAATGGTAGCGAC (SEQ ID NO:50). 

The PCR reaction product was purifed and digested with Not I restriction enzyme. The 
digested product was subcloned into the pBluescript II SK cloning vector (Stratagene) and 
sequenced. The sequence information was used in the generation of primer sequences which 
were subsequently used to PCR amplify the target gene encoding the endoglucanase. The primer 
sequences used were as follows: 

5' primer: TTTATTCAATTGATTAAAGAGGAGAAATTAACTATGATAAACGTTGC 
AACGGGAGAGGAG (SEQ ID NO:51) and 

3' primer: TTTATTGGATCCTACTTTGTGTCAACGAAGTATCC (SEQ ID NO:52). 

The amplification product was digested with the restriction enzymes Mfel and BamHI. 
The digested product was then ligated to pQET cloning vector, a modified form of a pQE vector 
(Qiagen, Inc.) which was previously digested with BamHI and EcoRI compatible with Mfel. The 
pQE vector encodes antibiotic resistance (Amp 1 ), a bacterial origin of replication (ori), an IPTG- 
regulatable promoter operator (P/O), a ribosome binding site (RBS), a 6-His tag and restriction 
enzyme sites. 

The amplified sequences were inserted in frame with the sequence encoding for the RBS. 
The ligation mixture was then used to transform the E. coli strain Ml5/pREP4 (Qiagen, Inc.) by 
electroporation. M15/pREP4 contains multiple copies of the plasmid pREP4, which expresses 
the lad repressor and also confers kanamycin resistance (Kan 7 ). Positive recombinant 
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transformants were identified as having thermostable CMCase/endoglucanase activity by the assay 
described above. Plasmid DNA was isolated and confirmed by restriction analysis. Clones 
containing the desired constructs were grown overnight (O/N) in liquid culture in LB media 
supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The O/N culture was used to 
inoculate a large culture at a ratio of 1:100 to 1:250. The cells were grown to an optical density 
600 (CD.* 50 ) of between 0.4 and 0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranoside") was then 
added to a final concentration of 1 mM. IPTG induces by inactivating the lad repressor, clearing 
the P/O leading to increased gene expression. Cells were grown an extra 3 to 4 hours. Cells 
were then harvested by centrifugation. 

The primer sequences set out above may also be employed to isolate the target gene from 
the deposited material by hybridization techniques described above. 

Numerous modifications and variations of the present invention are possible in light of 
the above teachings and, therefore, within the scope of the appended claims, the invention may 
be practiced otherwise than as particularly described. It is to be understood that, while the 
invention has been described with reference to the above detailed description, the foregoing 
description is intended to illustrate, but not to limit, the scope of the invention. Other aspects, 
advantages, and modifications of the invention are within the scope of the following claims. All 
publications, patent applications, patents, and other references mentioned herein are incorporated 
by reference in their entirety. 



36 



WO 97/44361 

What Is Claimed Is: 



PCT/US97/08793 



1 . Substantially pure endoglucanase having an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 
38, 40, 42, 44, 46, and 48. 

2. An isolated polynucleotide sequence encoding an endoglucanase of claim 1. 

3. An isolated polynucleotide selected from the group consisting of: 

a) SEQ ID NO:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45, and 47; 

b) SEQ ID NO:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45, and 47, wherein T can also be U; 

c) nucleic acid sequences complementary to a) and b); and 

d) fragments of a), b), or c) that are at least 15 bases in length and that will 
selectively hybridize to DNA which encodes the amino acid sequences of SEQ ID 
NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 
46, and 48. 

4. The polynucleotide of claim 2, wherein the polynucleotide is isolated from a prokaryote. 

5. An expression vector including the polynucleotide of claim 2. 

6. The vector of claim 5, wherein the vector is a plasmid. 

7. The vector of claim 5, wherein the vector is a virus-derived. 

8. A host cell transformed with the vector of claim 5. 

9. The host cell of claim 8, wherein the cell is prokaryotic. 
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10. Antibodies that bind to the polypeptide of claim 1. 

11. The antibodies of claim 10, wherein the antibodies are polyclonal. 

12. The antibodies of claim 10, wherein the antibodies are monoclonal. 

13. An enzyme comprising a member selected from the group consisting of: 

a) an enzyme comprising an amino acid sequence which is at least 70% identical to 
the amino acid sequence set forth in SEQ ID NO:2; and 

b) an enzyme which comprises at least 30 amino acid residues to the enzyme of a). 

14. A method for producing an enzyme comprising growing a host cell of claim 8 under 
conditions which allow the expression of the nucleic acid and isolating the enzyme 
encoded by the nucleic acid. 

15. A method for degrading carboxymethylcellulose comprising contacting 
carboxymethylcellulose with a degrading effective amount of the enzyme of claim 1. 

16. A method for hydrolyzing the beta 1,4 glycosidic bond in cellulose comprising contacting 
an effective amount of the enzyme of claim 1 with cellulose to hydroiyze the glycosidic 
bond. 
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FIGURE 1A 

(SEQ ID NO : 1 -nucleotide sequence and SEQ ID NO:2-amino acid sequence) 
AEFIIla Archaeal Endoglucanase Sequence 



9 18 27 36 45 54 

5 1 ATG ATA AAC GTT GCA ACG GGA GAG GAG ACC CCA ATA CAC CTC TTT GGA GTC AAC 



Met He Asn Val Ala Thr Gly Glu Glu Thr Pro He His Leu Phe Gly Val Asn 

63 72 81 90 99 108 

TGG TTC GGC TTT GAG ACA CCG AAC TAC GTT GTT CAC GGC CTA TGG AGT AGG AAC 



Trp Phe Gly Phe Glu Thr Pro Asn Tyr Val Val His Gly Leu Trp Ser Arg Asn 

117 126 135 144 153 162 

TGG GAG GAC ATG CTC CTC CAG ATC AAG AGC CTT GGC TTC AAT GCG ATA AGG CTT 



Trp Glu Asp Met Leu Leu Gin lie Lys Ser Leu Gly Phe Asn Ala He Arg Leu 

171 180 189 198 207 216 

CCC TTC TGT ACC CAG TCA GTA AAA CCG GGG ACG ATG CCA ACG GCG ATT GAC TAC 



Pro Phe Cys Thr Gin Ser Val Lys Pro Gly Thr Met Pro Thr Ala lie Asp Tyr 

225 234 243 252 2S1 270 

GCC AAG AAC CCA GAC CTC CAG GGT CTT GAC AGC GTC CAG ATA ATG GAG AAA ATA 



Ala Lys Asn Pro Asp Leu Gin Gly Leu Asp Ser Val Gin He Met Glu Lys He 

279 288 297 306 315 324 

ATC AAG AAG GCT GGA GAC CTG GGC ATA TTC GTG CTC CTC GAC TAC CAC AGA ATA 



He Lys Lys Ala Gly Asp Leu Gly He Phe Val Leu Leu Asp Tyr His Arg He 



WO 97/44361 



2/121 



PCT/US97/08793 



333 342 351 360 369 378 

GGA TGC AAC TTC ATA GAA CCC CTA TGG TAC ACC GAC AGC TTC TCG GAG CAG GAC 



Gly Cys Asn Phe lie Glu Pro Leu Trp Tyr Thr Asp Ser Phe Ser Glu Gin Asp 

387 396 405 414 423 432 

TAC ATA AAC ACC TGG GTT GAA GTC GCC CAG AGG TTC GGC AAG TAC TGG AAC GTT 



Tyr lie Asn Thr Trp Val Glu Val Ala Gin Arg Phe Gly Lys Tyr Trp Asn Val 

441 450 459 468 477 486 

ATC GGC GCG GAC CTG AAG AAC GAA CCC CAC AGC TCA AGC CCC GCA CCT GCC GCC 



lie Gly Ala Asp Leu Lys Asn Glu Pro His Ser Ser Ser Pro Ala Pro Ala Ala 

495 504 513 522 531 540 

TAC ACT GAC GGA AGT GGG GCC ACG TGG GGA ATG GGC AAC AAC GCC ACC GAC TGG 



Tyr Thr Asp Gly Ser Gly Ala Thr Trp Gly Met Gly Asn Asn Ala Thr Asp Trp 

549 558 567 576 585 594 

AAC CTG GCG GCT GAG AGG ATA GGA AGG GCA ATT CTG GAG GTT GCC CCA CAA TGG 



Asn Leu Ala Ala Glu Arg lie Gly Arg Ala He Leu Glu Val Ala Pro Gin Trp 

603 612 621 630 639 648 

GTT ATA TTT GTT GAG GGA ACC CAG TTC ACC ACC CCC GAG ATA GAC GGT AGG TAC 



Val He Phe Val Glu Gly Thr Gin Phe Thr Thr Pro Glu He Asp Gly Arg Tyr 

657 666 675 684 693 702 

AAG TGG GGC CAC AAC GCC TGG TGG GGC GGA AAC CTT ATG GGT GTT AGG AAG TAC 



Lys Trp Gly His Asn Ala Trp Trp Gly Gly Asn Leu Met Gly Val Arg Lys Tyr 
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711 720 729 738 747 756 

CCA GTT AAC CTG CCC AGG GAC AAG GTT GTT TAC AGC CCC CAA GTT TAC GGT TCA 



Pro Val Asn Leu Pro Arg Asp Lys Val Val Tyr Ser Pro Gin Val Tyr Gly Ser 

765 774 783 792 801 810 

GAA GTT TAC GAC CAG CCC TAC TTT GAC CCC GGT GAG GGG TTC CCC GAC AAC CTC 



Glu Val Tyr Asp Gin Pro Tyr Phe Asp Pro Gly Glu Gly Phe Pro Asp Asn Leu 

819 828 83-7 846 855 864 

CCC GAA ATA TGG TAC CAC CAC TTC GGC TAC GTA AAG CTT GAT CTC GGT TAC CCT 



Pro Glu lie Trp Tyr His His Phe Gly Tyr Val Lys Leu Asp Leu Gly Tyr Pro 

873 882 891 900 909 918 

GTT GTT ATA GGT GAG TTC GGA GGC AAG TAC GGC CAT GGG GGA GAC CCG AGG GAT 



Val Val lie Gly Glu Phe Gly Gly Lys Tyr Gly His Gly Gly Asp Pro Arg Asp 

927 936 945 954 963 972 

GTC ACT TGG CAG AAC AAG ATA ATA GAC TGG ATG ATC CAG AAC AAA TTC TGT GAC 



Val Thr Trp Gin Asn Lys He He Asp Trp Met He Gin Asn Lys Phe Cys Asp 

981 990 999 1008 1017 1026 

TTC TTC TAC TGG AGC TGG AAC CCA AAC AGC GGT GAC ACC GGT GGA ATT CTG AAG 



Phe Phe Tyr Trp Ser Trp Asn Pro Asn Ser Gly Asp Thr Gly Gly He Leu Lys 

1035 1044 1053 1062 1071 1080 

GAT GAC TGG ACG ACA ATA TGG GAG GAC AAG TAC AAC AAC CTG AAG AGG CTC ATG 



Asp Asp Trp Thr Thr He Trp Glu Asp Lys Tyr Asn Asn Leu Lys Arg Leu Met 
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1089 1098 1107 1116 1125 1134 

GAC AGC TGT TCT GGA AAC GCC ACT GCC CCG TCC GTC CCC ACG ACA ACT ACA ACA 



Asp Ser Cys Ser Gly Asn Ala Thr Ala Pro Ser Val Pro Thr Thr Thr Thr Thr 

1143 1152 1161 1170 1179 1188 

ACA AGC ACA CCG CCA ACG ACC ACA ACG ACT ACA ACA TCC ACT CCA ACG ACC ACT 



Thr Ser Thr Pro Pro Thr Thr Thr Thr Thr Thr Thr Ser Thr Pro Thr Thr Thr 

1197 1206 1215 1224 1233 1242 

ACC CAG ACC CCG ACC ACC ACT ACT CCA ACT ACG ACA ACC ACC ACG ACC ACA ACT 



Thr Gin Thr Pro Thr Thr Thr Thr Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr 
1251 1260 1269 1278 1287 1296 

CCT TCA AAT AAC GTC CCA TTT GAA ATT GTG AAC GTT CTC CCG ACT AGC TCC CAG 



Pro Ser Asn Asn val Pro Phe Glu lie Val Asn Val Leu Pro Thr Ser Ser Gin 

1305 1314 1323 1332 1341 1350 

TAC GAG GGA ACC AGC GTG GAG GTT GTA TGT GAT GGA ACC CAG TGT GCC TCC AGC 



Tyr Glu Gly Thr Ser Val Glu Val Val Cys Asp Gly Thr Gin Cys Ala Ser Ser 

1359 136B 1377 1386 1395 1404 

GTT TGG GGA GCT CCG AAC CTC TGG GGA GTC GTT AAA ATC GGA AAC GCC ACC ATG 



Val Trp Gly Ala Pro Asn Leu Trp Gly Val Val Lys He Gly Asn Ala Thr Met 

1413 1422 1431 1440 1449 1458 

GAC CCC AAC GTT TGG GGC TGG GAG GAC GTT TAC AAG ACT GCA CCC CAG GAC ATT 



Asp Pro Asn Val Trp Gly Trp Glu Asp Val Tyr Lys Thr Ala Pro Gin Asp lie 
1467 1476 1485 1494 1503 1512 
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GGA ACC GGC AGC ACA AAG ATG GAG ATA AGG AAC GGG GTG CTC AAG GTT ACA AAC 

Gly Thr Gly Ser Thr Lys Met Glu He Arg Asn Gly Val Leu Lys Val Thr Asn 

1S21 1530 1539 1548 1557 1566 

CTC TGG AAC ATC AAC ATG CAT CCG AAG TAT AAC ACA ATG GCA TAC CCG GAG GTC 



Leu Trp Asn lie Asn Met His Pro Lys Tyr Asn Thr Met Ala Tyr Pro Glu Val 

157S 1564 1593 1602 1611 1620 

ATA TAC GGC GCC AAG CCT TGG GGC AAC CAG CCA ATA AAC GCT CCG AAC TTC GTG 



He Tyr Gly Ala Lys Pro Trp Gly Asn Gin Pro He Asn Ala Pro Asn Phe Val 

1629 1638 1647 1656 1665 1674 

CTC CCG ATA AAG GTC TCC CAG CTT CCG AGG ATA CTC GTT GAC ACA AAG TAC ACG 



Leu Pro He Lys Val Ser Gin Leu Pro Arg He Leu Val Asp Thr Lys Tyr Thr 
1683 1692 1701 1710 1719 1728 

CTC GAA AAG AGC TTC CCG GGA AAC AAC TTC GCC TTT GAG GCC TGG CTC TTC AAG 



Leu Glu Lys Ser Phe Pro Gly Asn Asn Phe Ala Phe Glu Ala Trp Leu Phe Lys 

1737 1746 1755 1764 1773 1782 

GAT GCC AAC AAC ATG AGG GCA CCA GGC CAG GGG GAC TAC GAG ATA ATG GTA CAG 



Asp Ala Asn Asn Met Arg Ala Pro Gly Gin Gly Asp Tyr Glu He Met Val Gin 

1791 1800 1809 1818 1827 1836 

CTC TAC ATC GAG GGC GGC TAT CCT GCG GGC TAC GAC AAG GGG CCA GTT CTC ACC 



Leu Tyr He Glu Gly Gly Tyr Pro Ala Gly Tyr Asp Lys Gly Pro Val Leu Thr 



1845 1854 1863 1872 1881 1890 

GTT GAT GTT CCG ATA ATC GTC GAT GGA AGG CTT GTA AAC CAG ACT TTT GAG CTC 
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Val Asp Val Pro lie lie Val Asp Gly Arg Leu Val Asn Gin Thr Phe Glu Leu 

1899 1908 1917 1926 1935 1944 

TAC GAC GTC ATA GCG GAT GCC GGA TGG AGG TTC TTC ACC TTC AAG CCA ACT AAG 



Tyr Asp Val lie Ala Asp Ala Gly Trp Arg Phe Phe Thr Phe Lys Pro Thr Lys 

1953 1962 1971 1980 1989 1998 

AAC TAC AAC GGC TCA GAG GTT GTG TTC GAC TAC ACC AAA TTC ATA GAA ATA GTT 



Asn Tyr Asn Gly Ser Glu Val Val Phe Asp Tyr Thr Lys Phe lie Glu lie Val 

2007 2016 2025 2034 2043 2052 

GAC AAC TAC CTC GGC GGT GGC AGC CTC ACG AAC CAC TAC CTG ATG TCC CTG GAA 



Asp Asn Tyr Leu Gly Gly Gly Ser Leu Thr Asn His Tyr Leu Met Ser Leu Glu 

2061 2070 2079 2088 2097 2106 

TTC GGT ACC GAG ATA TAC ACC AAC GGG TGC ACC TCA TTC CCA TGC ACA GTG GAC 



Phe Gly Thr Glu lie Tyr Thr Asn Gly Cys Thr Ser Phe Pro Cys Thr Val Asp 
2115 2124 2133 2142 2151 2160 

GTA AGG TGG ACC CTT GAC AAG TAC AGG TTC ATC CTG GCC CCA GGA ACA ATG GCC 



Val Arg Trp Thr Leu Asp Lys Tyr Arg Phe lie Leu Ala Pro Gly Thr Met Ala 

2169 2178 2187 2196 2205 2214 

ACT GAG GAG GCC ATG AGA GTT CTC GTC GGA GAG GTC CAG CCT CCC GCT TCC ACA 



Thr Glu Glu Ala Met Arg Val Leu Val Gly Glu Val Gin Pro Pro Ala Ser Thr 

2223 2232 2241 2250 2259 2268 

ACA ACA TCG CAG ACG ACT ACT TCA ACC ACA ACC CCA ACG CCC ACT ACC ACT ACT 
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Thr Thr Ser Gin Thr Thr Thr Ser Thr Thr Thr Pro Thr Pro Thr Thr Thr Thr 

2277 2286 2295 2304 2313 2322 

ACG ACT CAG ACT TCA ACC ACC ACT ACA ACC ACC TCA CCG CCG ACA ACC ACC GCA 



Thr Thr Gin Thr Ser Thr Thr Thr Thr Thr Thr Ser Pro Pro Thr Thr Thr Ala 

2331 2340 2349 2358 2367 2376 

CCT GCT CAG GAC GTA ATT AAG CTC AGG TAC CCG GAC GAT GGG CAG TGG CCC GAG 



Pro Ala Gin Asp Val lie Lys Leu Arg Tyr Pro Asp Asp Gly Gin Trp Pro Glu 

2385 2394 2403 2412 2421 2430 

GCC CCA ATT GAC AGG GAT GGA GAC GGA AAC CCA GAG TTC TAC ATA GAA ATA AAC 



Ala Pro lie Asp Arg Asp Gly Asp Gly Asn Pro Glu Phe Tyr He Glu He Asn 

2439 2448 2457 2466 2475 2484 

CCG TGG AAC ATA CTG AGC GCT GAA AGC TAC GCC GAG ATG ACC TAC AAC TTG AGC 



Pro Trp Asn He Leu Ser Ala Glu Ser Tyr Ala Glu Met Thr Tyr Asn Leu Ser 

2493 2502 2511 2520 2S29 

AGC GGG GTT CTC CAC TAC GTC CAG GCC CTG GAT AGT ATA TGA TGA 3 1 



Ser Gly Val Leu His Tyr Val Gin Ala Leu Asp Ser He *** *** 
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0C9a (clone # 27GA1) Glycosidase 

1 

ATG CCA ACC AAT GTA TTT TTC AAC GCC CAT CAC TCG CCG GTT GGG GCG TTT 
Met Pro Thr Asn Val Phe Phe Asn Ala His His Ser Pro Val Gly Ala Phe 

GCC AGC TTT ACG CTA GGG TTT CCG GGA AAA AGC GGA GGA CTG GAC TTG GAA 
Ala Ser Phe Thr Leu Gly Phe Pro Gly Lys Ser Gly Gly Leu Asp Leu Glu 

CTT GCC CGA CCG CCA CGG CAA AAT GTC TTT ATT GGC GTT GAG TCG CCG CAT 
Leu Ala Arg Pro Pro Arg Gin Asn Val Phe He Gly Val Glu Ser Pro His 

GAG CCG GGG CTG TAT CAT ATC CTT CCA TTC GCG GAA ACA GCA GGC GAG GAT 
Glu Pro Gly Leu Tyr His lie Leu Pro Phe Ala Glu Thr Ala Gly Glu Asp 

GAA AGC AAA CGA TAT GAC ATT GAA AAT CCT GAT CCG AAT CCG CAA AAA CCA 
Glu Ser Lys Arg Tyr Asp lie Glu Asn Pro Asp Pro Asn Pro Gin Lys Pro 

AAC ATC CTG ATT CCA TTT GCG AAA GAG CGG ATC GAA CGC GAA TTT CGC GTT 
Asn He Leu He Pro Phe Ala Lys Glu Arg He Glu Arg Glu Phe Arg Val 

GCC ACG GAT ACA TGG AAG GCC GGG GAC TTG ACG TTG ACG ATT TAT TCA CCG 
Ala Thr Asp Thr Trp Lys Ala Gly Asp Leu Thr Leu Thr He Tyr Ser Pro 
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GTG AAG GCC GTA CCA GAT CCG GAA 
Val Lys Ala Val Pro Asp Pro Glu 

GCG TTG GTT CCA GCT GTC ATT GTC 
Ala Leu Val Pro Ala Val lie Val 

ACA AGA ACA CGA CGG GCG TTT TTC 
Thr Arg Thr Arg Arg Ala Phe Phe 

TCG ATG CGG GGG ATC GAT GAT ACA 
Ser Met Arg Gly lie Asp Asp Thr 

GGG CGG ATT TTG GGC ATA GCA TCC 
Gly Arg lie Leu Gly lie Ala Ser 

CAT TTT AGC ATG GAG GAT ATC TTA 
His Phe Ser Met Glu Asp He Leu 

TTT GGG CTC GGG AAA GTC GGT GCA 
Phe Gly Leu Gly Lys Val Gly Ala 

AAG AAA ACG TAT CAA TTT GCT GTT 
Lys Lys Thr Tyr Gin Phe Ala Val 
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ACG GCC TCC GAG GAA GAA CTC AAG TTG 
Thr Ala Ser Glu Glu Glu Leu Lys Leu 

GAG ATG ACG ATC GAT AAT ACG AAC GGA 
Glu Met Thr He Asp Asn Thr Asn Gly 

GGA TTC GAA GGC ACT GAC CCG TAT ACC 
Gly Phe Glu Gly Thr Asp Pro Tyr Thr 

TGC CCG CAG CTG CGC GGT GTC GGT CAA 
Cys Pro Gin Leu Arg Gly Val Gly Gin 

AAG GAT GAG GGC GTT CGT TCA GCA CTG 
Lys Asp Glu Gly Val Arg Ser Ala Leu 

ACG GCG ACT CTC GAA GAA AAC TGG ACG 
Thr Ala Thr Leu Glu Glu Asn Trp Thr 

TTA ATT GCG GAT GTG CCG GCG GGA GAA 
Leu He Ala Asp Val Pro Ala Gly Glu 

TGC TTC TAT CGT GGG GGT TGT GTG ACG 
Cys Phe Tyr Arg Gly Gly Cys Val Thr 
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GCG GGA ATG GAT GCC TCT TAT TTT 
Ala Gly Met Asp Ala Ser Tyr Phe 

GAA GTC GGT CTT TAT GCG TTA GAG 
Glu Val Gly Leu Tyr Ala Leu Glu 

TTC CGT TCG AAT GAA CTC ATT GAA 
Phe Arg Ser Asn Glu Leu He Glu 

TTT ATG ATG GCG CAC GCG ATC CGT 
Phe Met Met Ala His Ala He Arg 

GAG CAT GAA GGA AAG CCG ATT TGG 
Glu His Glu Gly Lys Pro He Trp 

ATG AAT ACG TTT GAT CTC ACC GTC 
Met Asn Thr Phe Asp Leu Thr Val 

AAT CCG TGG ACG GTG AAA AAT GTG 
Asn Pro Trp Thr Val Lys Asn Val 

TAT GAG GAT CGT GTC CGT TTC CCA 



TAC ACC CGT TTC TTC CAT AAT ATC GAA 
Tyr Thr Arg Phe Phe His Asn He Glu 

CAG GCC GAG GTG TTA AAA GAG CAG GCG 
Gin Ala Glu Val Leu Lys Glu Gin Ala 

AAA GAA TGG CTC TCC GAT GAT CAA AAG 
Lys Glu Trp Leu Ser Asp Asp Gin Lys 

AGC TAC TAT GGC AAT ACA CAG CTG CTT 
Ser Tyr Tyr Gly Asn Thr Gin Leu Leu 

GTC GTC AAT GAA GGC GAG TAC CGG ATG 
Val Val Asn Glu Gly Glu Tyr Arg Met 

GAC CAG CTC TTT TTT GAA TTG AAA ATG 
Asp Gin Leu Phe Phe Glu Leu Lys Met 

CTT GAC TTT TAT GTC GAG CGC TAC AGC 
Leu Asp Phe Tyr Val Glu Arg Tyr Ser 

GGA GAT GAG ACG GAA TAC CCC GGC GGC 
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Tyr Glu Asp Arg Val Arg Phe Pro 

ATC AGC TTC ACT CAC GAT ATG GGA 
lie Ser Phe Thr His Asp Met Gly 

TAC TCG TCA TAT GAG CTA TAC GGG 
Tyr Ser Ser Tyr Glu Leu Tyr Gly 

CAC GAA CAG CTC GTC AAC TGG GTG 
His Glu Gin Leu Val Asn Trp Val 

ACG AAA GAC TGG GCA TGG CGC GAC 
Thr Lys Asp Trp Ala Trp Arg Asp 

CTC GAA AGC ATG GTG CGC CGC GAT 
Leu Glu Ser Met Val Arg Arg Asp 

GTG ATG GGG CTT GAC AGC ACC CGC 
Val Met Gly Leu Asp Ser Thr Arg 

TAT GAT AGT TTG GAT GTT TCT CTT 
Tyr Asp Ser Leu Asp Val Ser Leu 
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Gly Asp Glu Thr Glu Tyr Pro Gly Gly 

GTC GCC AAC ACG TTC TCA CGC CCG CAT 
Val Ala Asn Thr Phe Ser Arg Pro His 

ATC AGC GGC TGC TTT TCA CAT ATG ACG 
lie Ser Gly Cys Phe Ser His Met Thr 

CTT TGC GCA GCG GTA TAC ATC GAA CAA 
Leu Cys Ala Ala Val Tyr lie Glu Gin 

CGG CGG CTT ACG ATC TTG GAA CAA TGT 
Arg Arg Leu Thr lie Leu Glu Gin Cys 

CAT CCG GAT CCA GAA AAG CGG AAC GGC 
His Pro Asp Pro Glu Lys Arg Asn Gly 

ACG ATG GGT GGA GCG GAA ATC ACA ACG 
Thr Met Gly Gly Ala Glu lie Thr Thr 

GGC CAG GCG CGC AAC AAT TTA TAT TTG 
Gly Gin Ala Arg Asn Asn Leu Tyr Leu 
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GCA GGA AAA TGT TGG GCT GCC TAT 
Ala Gly Lys Cys Trp Ala Ala Tyr 

GTC GGC AAA GAA GAA CTG GCT GCA 
Val Gly Lys Glu Glu Leu Ala Ala 

GCC GCG ACG ATT GTC AGT CAC GTG 
Ala Ala Thr lie Val Ser His Val 

ATG GGA GAA GGA AAT GAC TCG AAA 
Met Gly Glu Gly Asn Asp Ser Lys 

TTT CCT TAG TTT ACG AAC TGC CAT 
Phe Pro Tyr Fhe Thr Asn Cys His 

GGA GAC TAT ATT CGT GCA CTG CGA 
Gly Asp Tyr lie Arg Ala Leu Arg 

GGA ATT TAC CTA TTC CCG GAC GGG 
Gly He Tyr Leu Phe Pro Asp Gly 

CAA CTC GTG GTT GAG CAA AAT TTA 
Gin Leu Val Val Glu Gin Asn Leu 
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GTG GCG CTC GAA AAG TTG TTC CGC GAT 
Val Ala Leu Glu Lys Leu Phe Arg Asp 

TTG GCA AGG GAG CAG GCG GAA AAA TGC 
Leu Ala Arg Glu Gin Ala Glu Lys Cys 

ACG GAG GAC GGG TAT ATC CCA GCC GTG 
Thr Glu Asp Gly Tyr He Pro Ala Val 

ATC ATT CCG GCT ATT GAG GGG CTT GTG 
He He Pro Ala He Glu Gly Leu Val 

GAG GCG TTA A5A GAA GAC GGA CGT TTT 
Glu Ala Leu Arg Glu Asp Gly Arg Phe 

CAA CAT TTG CAA TAT GTG TTG CGG GAA 
Gin His Leu Gin Tyr Val Leu Arg Glu 

GGA TGG AAA ATT TGC CTC GAC AAG CAA 
Gly Trp Lys He Cys Leu Asp Lys Gin 

CTT ATG CCA GTT TAT TGC CCG CCG CAT 
Leu Met Pro Val Tyr Cys Pro Pro His 
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TTT AGG GTG GGA ATG GGA TGA 1958 
Phe Arg Val Gly Met Gly END 
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Bankia gouldi mix (Clone # 37GP2) Glycosidase 

l 

ATG TTG AAA AAA CTG GCT TTA GCA GCC GGG ATC GCA GCA GCA ACA CTG GCT 
Met Leu Lys Lys Leu Ala Leu Ala Ala Gly lie Ala Ala Ala Thr Leu Ala 

GCA TCC GGT TCC CAT GGG CAG ACG TTC GCG TAC GGC GAA GCT CTG CAA AAA 
Ala Ser Gly Ser His Gly Gin Thr Phe Ala Tyr Gly Glu Ala Leu Gin Lys 

TCC ATC TAT TTT TAT GAG GCT CAA CAG GCC GGC CCA CTC CCG GAA TGG AAC 
Ser lie Tyr Phe Tyr Glu Ala Gin Gin Ala Gly Pro Leu Pro Glu Trp Asn 

CGC GTT GCC TGG CGT GGC GAC TCA GTT CCT GAT GAC GGT GCC GAC GTC GGA 
Arg Val Ala Trp Arg Gly Asp Ser Val Pro Asp Asp Gly Ala Asp Val Gly 

CTG GAT TTA CGC GGT GGC TGG TTC GAT GCG GGC GAC CAC GTT AAG TTT GGC 
Leu Asp Leu Arg Gly Gly Trp Phe Asp Ala Gly Asp His Val Lys Phe Gly 

TTT CCA ATG GCC GCG TCA GCG ACA CTC GTC GCC TGG GGA GGC GTC GAT TAC 
Phe Pro Met Ala Ala Ser Ala Thr Leu Val Ala Trp Gly Gly Val Asp Tyr 

AAA GAC GCG TAC GAA CAG TCG GGG CAA ATG GAA CAT CTG CGC AAC AAC CTG 
Lys Asp Ala Tyr Glu Gin Ser Gly Gin Met Glu His Leu Arg Asn Asn Leu 
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CGC TTC GTC AAT GAC TAC TTT ATC 
Arg Phe Val Asn Asp Tyr Phe lie 

TAC GGG CAG GTT GGC GAT GGC AGT 
Tyr Gly Gin Val Gly Asp Gly Ser 

GAG GTT CTG CAC CAC AAG ATC CCC 
Glu Val Leu His His Lys lie Pro 

GAA AGC TGC CCG GGT ACC GAT CTG 
Glu Ser Cys Pro Gly Thr Asp Leu 

GCG TCT GCG ATG GTT TTT CAG GGT 
Ala Ser Ala Met Val Phe Gin Gly 

ATC ACT CAC GCC AAA CAG CTG TGG 
lie Thr His Ala Lys Gin Leu Trp 

ACC GGT ACA GAT ACA GCC TAT TCC 
Thr Gly Thr Asp Thr Ala Tyr Ser 

TAT ACG TCG ACG TAT GGC GTT TAC 
Tyr Thr Ser Thr Tyr Gly Val Tyr 
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AGC GCG CAC CCC GCT CCG AAC GTG CTT 
Ser Ala His Pro Ala Pro Asn Val Leu 

GCA GAC CAT ACC TTC TGG GGT CCC GCT 
Ala Asp His Thr Phe Trp Gly Pro Ala 

GGC TCG CGC ATT TCT ATG AAG ATT GAC 
Gly Ser Arg He Ser Met Lys He Asp 

GCC GCA GAG ACC GCA GCA GCG ATG GCC 
Ala Ala Glu Thr Ala Ala Ala Met Ala 

GAG GAC GAT GCT TAC GCA GCA ACC CTG 
Glu Asp Asp Ala Tyr Ala Ala Thr Leu 

CAA TTT GCT GAT TCA ACC AAA GGC ACA 
Gin Phe Ala Asp Ser Thr Lys Gly Thr 

AAT TGC ATA ACA GGT GCA CAG GGC TTT 
Asn Cys He Thr Gly Ala Gin Gly Phe 

TAC GAT GAA CTT GCC TGG GGT GCT CTC 
Tyr Asp Glu Leu Ala Trp Gly Ala Leu 
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TGG TTA TGG CGC GCA 
Trp Leu Trp Arg Ala 

TAC TAC GGT TTG ATG 
Tyr Tyr Gly Leu Met 

TGG TCG CTT GGC TGG 
Trp Ser Leu Gly Trp 

GCA CTT GTA GGT GAC 
Ala Leu Val Gly Asp 

CAC TGG AGC GTC GGC 
His Trp Ser Val Gly 

GAC TCC TGG GGG GTA 
Asp Ser Trp Gly Val 

TTT TAT GCA GAT GCG 
Phe Tyr Ala Asp Ala 

AAT TTT GGT AAG AAG 
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ACT GGA GAA GAC TTC TAC 
Thr Gly Glu Asp Phe Tyr 

GGC TTT GAA AAC CAG ACG 
Gly Phe Glu Asn Gin Thr 

AAC GAT AAA GCG TAT GCC 
Asn Asp Lys Ala Tyr Ala 

GAG GTT TAC CAC GCA GAT 
Glu Val Tyr His Ala Asp 

GAG GGT AAC CGC ACA CCC 
Glu Gly Asn Arg Thr Pro 

AAC CGC TAT GCG GCC AAC 
Asn Arg Tyr Ala Ala Asn 

ATT GGC AGT GAC CAC CCC 
He Gly Ser Asp His Pro 

CAG ATC GAT CAT ATC CTG 
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CTG GAA CAA GCC AAG CAT 
Leu Glu Gin Ala Lys His 

ACA ACT CCG GTA TAT ACC 
Thr Thr Pro Val Tyr Thr 

GTT TAT GTA CTT ATG GCC 
Val Tyr Val Leu Met Ala 

GCA CAG CGC TAC CTG GAT 
Ala Gin Arg Tyr Leu Asp 

AAT GGG CTG ATT CTG GTC 
Asn Gly Leu He Leu Val 

GCG GGT TAT CTC GCA CTC 
Ala Gly Tyr Leu Ala Leu 

CTT TAT GAT CGT TAC CAC 
Leu Tyr Asp Arg Tyr His 

GGC GAC AAC CCT GAC AAC 
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Asn Phe Gly Lys Lys Gin He Asp 

CAA AGC TAC GTC GTC GGC TTT GGC 
Gin Ser Tyr Val Val Gly Phe Gly 

CGT GGC TCC CAC GGT TCC TGG TCC 
Arg Gly Ser His Gly Ser Trp Ser 

CGC CAT GTG CTA TAC GGC GCA GTT 
Arg His Val Leu Tyr Gly Ala Val 

TAT GAA GAA GAC CGC AAT GAC TAT 
Tyr Glu Glu Asp Arg Asn Asp Tyr 

AAC TCA GGC TTC ACC AGT GCC GTC 
Asn Ser Gly Phe Thr Ser Ala Val 

GCG CCC CTG GCG AAC TTC CCG CCT 
Ala Pro Leu Ala Asn Phe Pro Pro 

GTG GGG GCC AAG ATC AAT TCC TCT 
Val Gly Ala Lys He Asn Ser Ser 
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His He Leu Gly Asp Asn Pro Asp Asn 

GAT AAT TTC CCA ATC AAT GTT CAC CAC 
Asp Asn Phe Pro He Asn Val His His 

GAC AGC ATT TCC AAC CCG GTT AAT CAA 
Asp Ser He Ser Asn Pro Val Asn Gin 

GCC GGT GGT CCG CAG GGC GAT ACA GGC 
Ala Gly Gly Pro Gin Gly Asp Thr Gly 

GTG CAG AAT GAG GTC GCA ACA GAC TAC 
Val Gin Asn Glu Val Ala Thr Asp Tyr 

GCT GCA CTT TAT GAT CAC TAT GGT GGC 
Ala Ala Leu Tyr Asp His Tyr Gly Gly 

CCC GAA CCA GAG TCG GTG GAG TAT CTG 
Pro Glu Pro Glu Ser Val Glu Tyr Leu 

GGC AAC CGC TTC GTG GAA ATG AAA GCC 
Gly Asn Arg Phe Val Glu Met Lys Ala 
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GTT ATT CAA AAC CAC AGC ACA ACA 
Val He Gin Asn His Ser Thr Thr 

ATG CGC TAT TTC TAT GAT CTG AGC 
Met Arg Tyr Phe Tyr Asp Leu Ser 

AAT GAT CTA ACG GTG GCG TCC GGA 
Asn Asp Leu Thr Val Ala Ser Gly 

CTG CAA CAT TGG GAT GGC AAC GTC 
Leu Gin His Trp Asp Gly Asn Val 

GAT GTG GTA TTT CCC GGT GGT CAG 
Asp Val Val Phe Pro Gly Gly Gin 

CGC GTG TCC CTG CCA ACC ACA TCC 

Arg Val Ser Leu Pro Thr Thr Ser 

GAC CCC TCG TTT GAT CCA AGT TAT 
Asp Pro Ser Phe Asp Pro Ser Tyr 

GGT ATC GAC GCG CCG AAA ATT CCA 
Gly He Asp Ala Pro Lys He Pro 
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CCC GCC CAA GGT AAA GAC GAC CTT TAC 
Pro Ala Gin Gly Lys Asp Asp Leu Tyr 

GAA GTA TTT GCC GCA GGC TAC AGT TTG 
Glu Val Phe Ala Ala Gly Tyr Ser Leu 

TAC AAC CAA GCC TCG GAT GTG AAT GGC 
Tyr Asn Gin Ala Ser Asp Val Asn Gly 

TAC TAT GTG GAA GCC CAG TTC TAT GAC 
Tyr Tyr Val Glu Ala Gin Phe Tyr Asp 

TCC GCG CAC CGA CGG GAA GTA CAA TTT 
Ser Ala Kis Arg Arg Glu Val Gin Phe 

AAT CTT GCC GAG TGG GAC AAC ACG AAC 
Asn Leu Ala Glu Trp Asp Asn Thr Asn 

TTA ACG GTC GAT AGT AGT CTG ACT TAC 
Leu Thr Val Asp Ser Ser Leu Thr Tyr 

CTC TAC GAC GCC AAC GGC CTG CTC TGG 
Leu Tyr Asp Ala Asn Gly Leu Leu Trp 
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GGC GAG GAG CCA CCC CGT GGC GGA 
Gly Glu Glu Pro Pro Arg Gly Gly 

TCG TCC TCT AGC TCA TCC AGC AGT 
ser Ser Ser Ser Ser Ser Ser Ser 

TCG TCC TCG AGT AAT TCG TCC TCT 
Ser Ser Ser Ser Asn Ser Ser Ser 

TCG TCG TCT AAC AGC AGT TCC TCG 
Ser Ser Ser Asn Ser Ser Ser Ser 

AGT TCG TCG AGT TCG GGC GGC ACC 
Ser Ser Ser Ser Ser Gly Gly Thr 

TGG ACC GCA CGT GAC TGG GCC GGT 
Trp Thr Ala Arg Asp Trp Ala Gly 

GAT TTG ATG GTT TAC CAA GGT ACT 
Asp Leu Met Val Tyr Gin Gly Thr 

AGT GTG CCT GGC AGT GAT GCA TCC 
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ACT TCC TCC AGC TCA TCG TCG AGC AGT 
Thr Ser Ser Ser Ser Ser Ser Ser Ser 

TCA TCG TCG AGC AGC TCC TCG AGC AGT 
Ser Ser Ser Ser Ser Ser ser Ser Ser 

AGC TCG TCC AGC TCT TCG TCG AAT TCT 
Ser Ser Ser Ser Ser Ser Ser Asn Ser 

TCC AGC TCA AGC TCA TCG AGC AGT TCC 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

TGT GCG GAC GTG AAC GTA TAC CCC AAC 
Cys Ala Asp Val Asn Val Tyr Pro Asn 

GGA GTA CCG AAC CAC GCG GAA GCC GGT 
Gly Val Pro Asn His Ala Glu Ala Gly 

GTC TAC CAA GCT AAT TGG TAC ACC AAC 
Val Tyr Gin Ala Asn Trp Tyr Thr Asn 

TGG ACC AAC CAA GGG TTA TGT GCC GGC 
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Ser Val Pro Gly Ser Asp Ala Ser 

GGC GGA TCC AGC TCC AGC AGC TCA 
Gly Gly Ser Ser Ser Ser Ser Ser 

AGC AGC AGC TCA AGC TCG TCC AGT 
Ser Ser Ser Ser Ser Ser Ser Ser 

AGC AGT TCG TCC TCG TCA AGT TCG 
Ser Ser Ser Ser Ser Ser Ser Ser 

GGT GGC GGC GCC ATG TGT AAC TGG 
Gly Gly Gly Ala Met Cys Asn Trp 

AAC ACC CCA TCT GGC TGG GGC AAC 
Asn Thr Pro Ser Gly Trp Gly Asn 

GAT ACT TGC CAA GAG GTC GTC AAC 
Asp Thr Cys Gin Glu Val Val Asn 
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Trp Thr Asn Gin Gly Leu Cys Ala Gly 

TCA TCC AGC TCA AGC AGC TCT TCG TCC 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

GGT GCG TCC GGT TCA TCC TCC AGC TCG 
Gly Ala Ser Gly Ser Ser Ser Ser Ser 

AGC AGC AGC TCT TCG AGT TCG TCT TCT 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

TAT GGC TGG CAA GTA CCT ATT TGT GAA 
Tyr Gly Trp Gin Val Pro lie Cys Glu 

GAA AAT GGC CAA ACA TGT GTC GGC CCC 
Glu Asn Gly Gin Thr Cys Val Gly Pro 

TAA 2627 
END 
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Bankia gouldi mix (Clone 

1 

ATG AAG ATG ACC TAC ATG CAT CCG 
Met Lys Met Thr Tyr Met His Pro 

GCG GAT CAG TTG GTC AAC TGG GCG 
Ala Asp Gin Leu Val Asn Trp Ala 

CAC ACT CTG GTT TGG CAC TCC GAA 
His Thr Leu Val Trp His Ser Glu 

TAC TCT GGT GAT GCA ACT GCA TTC 
Tyr Ser Gly Asp Ala Thr Ala Phe 

ACT GTG GCT GAG CAT TTT GCT GGC 
Thr Val Ala Glu His Phe Ala Gly 

GAA GTG CTG GAG CCG GGC TCC AAT 
Glu Val Leu Glu Pro Gly Ser Asn 

TAC CAG AAG CTT GGC AAA GAC TTT 
Tyr Gin Lys Leu Gly Lys Asp Phe 
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# 3 7GP3) Glycosidase 

GCT GAA GAT ACT TAC TCG TTT GGT CAA 
Ala Glu Asp Thr Tyr Ser Phe Gly Gin 

AAA GCG AAT GGT ATT GGC GTG CAC GGC 
Lys Ala Asn Gly lie Gly Val His Gly 

TAC CAG GTA CCC AAT TGG ATG AAA AAT 
Tyr Gin Val Pro Asn Trp Met Lys Asn 

CAA ACC ATG CTC AAC ACC CAT GTG AAA 
Gin Thr Met Leu Asn Thr His val Lys 

GAA CTG GAC AGC TGG GAC GTT GTG AAT 
Glu Leu Asp Ser Trp Asp Val Val Asn 

GGT TGC TGG CGT GAA AAC TCT CTG TTC 
Gly Cys Trp Arg Glu Asn Ser Leu Phe 

GTC GCG AAC GCA TTC CGT GCA GCT CGC 
Val Ala Asn Ala Phe Arg Ala Ala Arg 
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GAG GGC GAT CCC AAT GCA GAC TTG 
Glu Gly Asp Pro Asn Ala Asp Leu 

GGT GTA ACT TCC GAT GAG AAG TTC 
Gly Val Thr Ser Asp Glu Lys Phe 

CTT CTG GAA GCG GAC GTG CCG ATT 
Leu Leu Glu Ala Asp Val Pro lie 

CAG GCG ACG TGG CCT AGC AAT GCC 
Gin Ala Thr Trp Pro Ser Asn Ala 

GCG GAT CGC GGT CTG AAA GTT AAA 
Ala Asp Arg Gly Leu Lys Val Lys 

AAC CCT TAC GGA ACC ACT AAT TTC 
Asn Pro Tyr Gly Thr Thr Asn Phe 

GCC GCC GAG CTG CAG AAG CAG CGC 
Ala Ala Glu Leu Gin Lys Gin Arg 

GAT AAC GTA CCG GCC AAC CTG CGT 
Asp Asn Val Pro Ala Asn Leu Arg 
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TAT TAC AAC GAT TAC TCG ACT GAA AAT 
Tyr Tyr Asn Asp Tyr Ser Thr Glu Asn 

AGT TGT TTG TTG GAA CTA GTC GAT GAG 
Ser Cys Leu Leu Glu Leu Val Asp Glu 

ACA GGT GTT GGT TTC CAA ATG CAC GTG 
Thr Gly Val Gly Phe Gin Met His Val 

AAC ATC GGC AAG GCA TTC AAA GCC ATC 
Asn He Gly Lys Ala Phe Lys Ala He 

ATT TCT GAG CTC GAT GTT CCT GTT AAC 
lie Ser Glu Leu Asp Val Pro Val Asn 

CCG CAA TAC AGC AGT TTT ACC GCG GAA 
Pro Gin Tyr Ser Ser Phe Thr Ala Glu 

TAC AAG GGC ATT ATG CAA GCG TAC CTT 
Tyr Lys Gly He Met Gin Ala Tyr Leu 

GGT GGT TTC ACC GTG TGG GGC GTT TGG 
Gly Gly Phe Thr Val Trp Gly Val Trp 
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GAT GGC GAT AGC TGG ATC ATG ACG 
Asp Gly Asp Ser Trp lie Met Thr 

AAC GAC TGG CCA CTG TTG TTC ACC 
Asn Asp Trp Pro Leu Leu Phe Thr 
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TTC AGC CAG TAC ACC AAC GCT AAC GCC 
Phe Ser Gin Tyr Thr Asn Ala Asn Ala 

GGG CCG TAA B48 
Gly Pro END 
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Teredinibaccer, pure (Clone 42GP1) Glycosidase 

1 

ATG GGA ACA TCT CTT ATG ATC AAA TCT ACA CTG ACA GGT ATG ATT ACT GCT 
Met Gly Thr Ser Leu Met He Lys Ser Thr Leu Thr Gly Met He Thr Ala 

GTT GCC GCC GCA GTT TTC ACC ACC TCT GCA GCT TTC GCG GAT GTA CCT CCG 
Val Ala Ala Ala Val Phe Thr Thr Ser Ala Ala Phe Ala Asp Val Pro Pro 

TTG ACA GTG AGC GGA AAT CAG GTT TTA AGT GGC GGT GAA GCA AAA AGC TTC 
Leu Thr Val Ser Gly Asn Gin Val Leu Ser Gly Gly Glu Ala Lys Ser Phe 

GCT GGT AAC AGC TTC TTT TGG AGC AAT ACC GGA TGG GGC CAG GAA CGT TTT 
Ala Gly Asn Ser Phe Phe Trp Ser Asn Thr Gly Trp Gly Gin Glu Arg Phe 

TAC AAC GCA GAA ACT GTG CGT TGG TTG AAA GAC GAC TGG AAC GCA ACC ATT 
Tyr Asn Ala Glu Thr Val Arg Trp Leu Lys Asp Asp Trp Asn Ala Thr He 

GTC CGC GCC GCT ATG GGC GTA GAC TTT GAT GGC AGC TAT ATC CCC GAG CAT 
Val Arg Ala Ala Met Gly Val Asp Phe Asp Gly Ser Tyr lie Pro Glu His 



GAA GAC GCC GAC CCC GAG GGT AAC GTC GCT CGC 
Glu Asp Ala Asp Pro Glu Gly Asn Val Ala Arg 



GTA CGT 



Val Arg 



GCA TTG GTG GAT 
Ala Leu Val Asp 
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GCA GCC ATC GCA GAA GAC ATG TAC 
Ala Ala lie Ala Glu Asp Met Tyr 

GCA GAA GAT TAC CAA GCC GAA TCT 
Ala Glu Asp Tyr Gin Ala Glu Ser 

CTG TAC GGT GGG TAC GAC AAT GTT 
Leu Tyr Gly Gly Tyr Asp Asn Val 

CAA ATC AGC TGG GAC AAT GTT ATT 
Gin He Ser Trp Asp Asn Val He 

GCT ATC CGC GCA ATC GAC CCG GAC 
Ala lie Arg Ala He Asp Pro Asp 

TGG TCA CAG GAC GTG GAC GCC GCT 
Trp Ser Gin Asp Val Asp Ala Ala 

AAT ATT GCG TAC ACC CTG CAC TTT 
Asn He Ala Tyr Thr Leu His Phe 

CGC GAT AAA GCG CGT AAC GCT ATG 
Arg Asp Lys Ala Arg Asn Ala Met 
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GTG ATT ATC GAT TTT CAC ACT CAC CAC 
Val He He Asp Phe His Thr His His 

ATC GAG TTC TTC GAA GAA ATG GCC ACA 
He Glu Phe Phe Glu Glu Met Ala Thr 

ATT TAT GAA ATC TAT AAC GAG CCC CTG 
He Tyr Glu He Tyr Asn Glu Pro Leu 

AAA CCT TAT GCA GAA TCG GTG ATT GGC 
Lys Pro Tyr Ala Glu Ser Val He Gly 

AAC CTG ATT ATC GTC GGC ACG CCC ACT 
Asn Leu He He Val Gly Thr Pro Thr 

GCG CGC AAT CCA ATC ACC AGC TAC AGC 
Ala Arg Asn Pro He Thr Ser Tyr Ser 

TAC GCA GGC ACT CAC GGT TCA TGG TTG 
Tyr Ala Gly Thr His Gly Ser Trp Leu 

AAC AGT GGT ATT GCG CTG TTT GTG ACT 
Asn Ser Gly He Ala Leu Phe Val Thr 
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GAG TGG GGC ACC GTT 
Glu Trp Gly Thr Val 

ACT CAG CAA TGG ATG 
Thr Gin Gin Trp Met 

TGG TCC GTG AGT GAT 
Trp Ser Val Ser Asp 

CCC ATT AGC GGC TGG 
Pro He Ser Gly Trp 

AAG AAC ATC GTT TCC 
Lys Asn He Val Ser 

AGT TCA TCC AGC TCC 
Ser Ser Ser Ser Ser 

TCC TCC TCC AGC AGC 
Ser Ser Ser Ser Ser 
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AAT GCA GAT GGC GAT GGT 
Asn Ala Asp Gly Asp Gly 

GAC TTC CTC AAG CAG AAC 
Asp Phe Leu Lys Gin Asn 

AAA TTG GAA GGT GCG TCT 
Lys Leu Glu Gly Ala Ser 

AAC GCT TCT GAC CTT ACG 
Asn Ala Ser Asp Leu Thr 

AAC TGG GGC ACC ACA ATC 
Asn Trp Gly Thr Thr He 

TCT TCC AGC TCT TCA AGC 
Ser Ser Ser Ser Ser Ser 

TCT TCC TCG TCA AGC AGC 
Ser Ser Ser Ser ser Ser 
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GCG CCT GCA GTT AAC GAA 
Ala Pro Ala Val Asn Glu 

AAT ATC TCT CAC TTG AAC 
Asn He Ser His Leu Asn 

ATC GTA CAA CCT GGC ACG 
He Val Gin Pro Gly Thr 

GCC TCC GGC ACA CTG GTT 
Ala Ser Gly Thr Leu Val 

GGT AAC GGC AGC TCC TCA 
Gly Asn Gly Ser Ser Ser 

AGT TCT TCT TCG AGC AGT 
Ser Ser Ser Ser Ser Ser 

TCC GGA TCA ACT GGT GGC 
Ser Gly Ser Thr Gly Gly 



GGC AAC TGT GCT GGA GTG AAT GTG TAC CCG AAC TGG ACC GCG CGT GAC TGG 
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Gly Asn Cys Ala Gly Val Asn Val 

TCT GGC GGC GCC TAC AAC CAT GCG 
Ser Gly Gly Ala Tyr Asn His Ala 

AAC AGC CTG TAT CGT GCC AAC TGG 
Asn Ser Leu Tyr Arg Ala Asn Trp 

GCC TCC TGG ACT AGC CTT GGC GCC 
Ala Ser Trp Thr Ser Leu Gly Ala 

TCC AGC TCA AGC AGC TCC TCG TCA 
Ser Ser Ser Ser Ser Ser Ser Ser 

TCG TCT ACT GGC GGT GGC TCC AGC 
Ser Ser Thr Gly Gly Gly Ser Ser 

TCG TCT TCC AGC AGC TCT AGC AGC 
Ser Ser Ser Ser Ser Ser Ser Ser 

TGC AAC TGG TAC GGT CAG GGA ACC 
Cys Asn Trp Tyr Gly Gin Gly Thr 
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Tyr Pro Asn Trp Thr Ala Arg Asp Trp 

AAC GCT GGC GAC CAA ATG GTC TAT CAA 
Asn Ala Gly Asp Gin Met Val Tyr Gin 

TAC ACC AAC AGC GTG CCT GGC AGC GAC 
Tyr Thr Asn Ser Val Pro Gly Ser Asp 

TGC GGA GGC AAC GGA AGT ACG ACC TCA 
Cys Gly Gly Asn Gly Ser Thr Thr Ser 

AGC AGC AGC TCT TCT TCC AGC AGC TCC 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

TCC TCC AGC AGT TCA TCT TCT TCA TCG 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

ACT GGT GGC GGT CAA TGT ACC GAA GTG 
Thr Gly Gly Gly Gin Cys Thr Glu Val 

TAC CCA CTG TGT AAC AAC ACC AGT GGT 
Tyr Pro Leu Cys Asn Asn Thr Ser Gly 



WO 97/44361 

28/121 

TGG GGT TGG GAA AAC AAT CAG AGC 
Trp Gly Trp Glu Asn Asn Gin Ser 

CAG AAC GGT GGC GCT GGC GGC GTG 
Gin Asn Gly Gly Ala Gly Gly Val 

TCC AGC AGC TCC TCT TCC AGC AGT 
Ser Ser Ser Ser Ser Ser Ser Ser 

TCA TCC AGC AGC TCT TCA TCT GGC 
Ser Ser Ser Ser Ser Ser Ser Gly 

AGC TCT TCC AGC AGC TCC AGC TCA 
Ser Ser Ser Ser Ser Ser Ser Ser 

CCA CGC GTG GAC AAC CCC TTC GCC 
Pro Arg Val Asp Asn Pro Phe Ala 

ATG TGG TCA GCG AGT GCT GCA AAC 
Met Trp Ser Ala Ser Ala Ala Asn 

GAA CCC TCG TTT GTA TGG ATG GAC 
Glu Pro Ser Phe Val Trp Met Asp 
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TGT ATC GGC CGT CAA ACC TGT GAG TCA 
Cys lie Gly Arg Gin Thr Cys Glu Ser 

GTG AGC AAC TGC ACC GGT TCG AGT ACA 
Val Ser Asn Cys Thr Gly Ser Ser Thr 

AGT TCT TCC TCA AGT AGC AGC TCC AGT 
Ser Ser Ser Ser Ser Ser Ser Ser Ser 

ACT GGT AGC AGT ACA TCT TCC AGC AGC 
Thr Gly Ser Ser Thr Ser Ser Ser Ser 

AGT ACC GGT TCC TCC GGT ATG CCT GGA 
Ser Thr Gly Ser Ser Gly Met Pro Gly 

GCT GCG CAG AAG TGG TAC ATA AAC CCA 
Ala Ala Gin Lys Trp Tyr lie Asn Pro 

GAA CCC GGC GGC TCT GTC ATT GCC AAC 
Glu Pro Gly Gly Ser Val He Ala Asn 

CGT ATC GGC GCA ATC GAA GGG CCT GCT 
Arg He Gly Ala He Glu Gly Pro Ala 
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GAC GGT ATG GGC CTG CGC GAC CAC 
Asp Gly Met Gly Leu Arg Asp His 

GAC CTG TTC ATG TTT GTT GTG TAC 
Asp Leu Phe Met Phe Val Val Tyr 

CTC GCC TCC AAC GGT GAA CTG CGC 
Leu Ala Ser Asn Gly Glu Leu Arg 

AAG TCC GAC TAC ATC GCA CCT ATC 
Lys Ser Asp Tyr He Ala Pro He 

GCA GGT ATC AAA ATC GCT GCG GTT 
Ala Gly He Lys He Ala Ala Val 

GTT ACC AAT CTG AGC GAA CCT GAC 
Val Thr Asn Leu Ser Glu Pro Asp 

TAC CGC GAC GGC ATT CGT CAC GCT 
Tyr Arg Asp Gly He Arg His Ala 

GTA TAC TCC TAC GTG GAT ATT GCA 



TTG AAC GAA GCC CTT GCA CAA GGC GCC 
Leu Asn Glu Ala Leu Ala Gin Gly Ala 

GAC CTG CCA AAC CGT GAC TGT GCT GCA 
Asp Leu Pro Asn Arg Asp Cys Ala Ala 

ATC TCC GAA GAT GGC TTC AAC ATC TAC 
He Ser Glu Asp Gly Phe Asn He Tyr 

GTT GAA ATC ATC AGC GAC CCT GCA TAC 
Val Glu He He Ser Asp Pro Ala Tyr 

ATC GAG GTG GAC TCA CTG CCT AAC CTG 
He Glu Val Asp Ser Leu Pro Asn Leu 

TGT CAG GAA GCA AAT GGT CCT GGC GGC 
Cys Gin Glu Ala Asn Gly Pro Gly Gly 

ATC ACT GAA CTG GGC AAA ATC CCC AAC 
He Thr Glu Leu Gly Lys lie Pro Asn 

CAC TCA GGC TGG CTG GGC TGG AAC GAC 
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Val Tyr Ser Tyr Val Asp lie Ala 

AAC TTC GCG CAA GGC GTT AAC CTG 
Asn Phe Ala Gin Gly Val Asn Leu 

TCC GGC ATT AAC CCA ATC GCC GGT 
Ser Gly lie Asn Pro lie Ala Gly 

CCT GTG GAA GAA CCC TTC TTG CCA 
Pro Val Glu Glu Pro Phe Leu Pro 

CCC GTT CGC TCT TCC GAT TTC TAT 
Pro Val Arg Ser Ser Asp Phe Tyr 

CCC TTC GTG ACC GAT TGG CGT TCT 
Pro Phe Val Thr Asp Trp Arg Ser 

TCC ATC GGT ATG CTG ATC GAT ACC 
Ser lie Gly Met Leu He Asp Thr 

CGT CCA ACT GCG CAG TCT ACC TCC 
Arg Pro Thr Ala Gin Ser Thr Ser 
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His Ser Gly Trp Leu Gly Trp Asn Asp 

ATT TAT GAA GTG GTT GCC AAC CTC GGT 

He Tyr Glu Val Val Ala Asn Leu Gly 

TTC GTC AGT AAC TCC GCT AAC TAC ACG 

Phe Val Ser Asn Ser Ala Asn Tyr Thr 

GAC GCC AAC CTG CAG GTC GGT GGT CAG 

Asp Ala Asn Leu Gin Val Gly Gly Gin 

GAG TGG AAC AGC TAC CTG GCA GAG AAA 

Glu Trp Asn Ser Tyr Leu Ala Glu Lys 

GCC ATG ATC TCG AAA GGT ATG CCA AGC 

Ala Met He Ser Lys Gly Met Pro Ser 

GCA CGT AAC GGC TGG GGT GGC CCT GAG 

Ala Arg Asn Gly Trp Gly Gly Pro Glu 

AAC AAC CTG AAC ACC TTC GTT AAC GAA 

Asn Asn Leu Asn Thr Phe Val Asn Glu 
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TCA CGT ATC GAC CGT CGT GAG CAC 
Ser Arg lie Asp Arg Arg Glu His 

GGT GTC GGC TAC CGT CCA ACC GCT 
Gly Val Gly Tyr Arg Pro Thr Ala 

GTT TGG GTG AAA CCA CAG GGT GAG 
Val Trp Val Lys Pro Gin Gly Glu 

GAG ATC GAT CCT AAC GAC CCG AAC 
Glu lie Asp Pro Asn Asp Pro Asn 

TTC GCC AGC AAC TCG TCC AAC AGT 
Phe Ala Ser Asn Ser Ser Asn Ser 

GCT CCG CAC GCT GGT CGC TGG TTC 
Ala Pro His Ala Gly Arg Trp Phe 

AAC GCT TAC CCA CCA ATT AAC TAA 
Asn Ala Tyr Pro Pro He Asn END 
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CGC GGC AAC TGG TGT AAC CAG CCT GGT 
Arg Gly Asn Trp Cys Asn Gin Pro Gly 

GCA CCT TCT CCA GGT ATT GAT GCC TAC 
Ala Pro Ser Pro Gly He Asp Ala Tyr 

TCT GAC GGT GTT TCC GAT CCT AAC TTC 
Ser Asp Gly Val Ser Asp Pro Asn Phe 

AAA CAG CAC GAC CCA ATG TGT GAT CCG 
Lys Gin His Asp Pro Met Cys Asp Pro 

GCA TAC GGC ACC GGC GCT ATG CCA AAT 
Ala Tyr Gly Thr Gly Ala Met Pro Asn 

CCT GAA GCC TTC CAG TTA CTG CTT GAA 
Pro Glu Ala Phe Gin Leu Leu Leu Glu 

3032 
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Microscilla furvescens (Clone # S3GC1) 

1 

ATG AAC AAG AAG TGG TGG AAA GAA GCC GTG GTG TAT CAA GTC TAC CCG CGG 
Met Asn Lys Lys Trp Trp Lys Glu Ala Val Val Tyr Gin Val Tyr Pro Arg 



AGC TTC AAA GAC AGC AAT GGA GAT GGT GTA GGC GAT CTG CCT GGG GTT ATT 
Ser Phe Lys Asp Ser Asn Gly Asp Gly Val Gly Asp Leu Pro Gly Val He 



GAA AAG CTT GAT TAC ATC AAA AGC CTT GGG GTG GAT GTT ATC TGG CTA TGC 

Glu Lys Leu Asp Tyr He Lys Ser Leu Gly Val Asp Val He Trp Leu Cys 

CCG GTG TAC GAT TCC CCC AAT GAT GAC AAT GGT TAC GAT ATT CGT GAC TAC 

Pro Val Tyr Asp Ser Pro Asn Asp Asp Asn Gly Tyr Asp He Arg Asp Tyr 



TAC GAT ATC ATG GCT GAT TTC GGC ACG ATG GCT GAT TTT GAT CAG CTG CTC 
Tyr Asp He Met Ala Asp Phe Gly Thr Met Ala Asp Phe Asp Gin Leu Leu 



GAG GGA ATA CAT CAG CGT GGG ATG 
Glu Gly He His Gin Arg Gly Met 

CAC TGC TCT GAT GAG CAC AAA TGG 
His Cys Ser Asp Glu His Lys Trp 



AAA CTG CTA ATG GAC CTG GTG GTA AAC 
Lys Leu Leu Met Asp Leu Val Val Asn 

TTT CAG GAG TCC CGC AAG AGT AAA GAC 
Phe Gin Glu Ser Arg Lys Ser Lys Asp 
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AAC CCT TAC CGG GAC TAC TTC ATC 
Asn Pro Tyr Arg Asp Tyr Phe lie 

CCT AAC AAC TGG CAG TCC TTT TTT 
Pro Asn Asn Trp Gin Ser Phe Phe 

GCC ACT GAC GAG TAT TAC CTA CAT 
Ala Thr Asp Glu Tyr Tyr Leu His 

AAT TGG GAA AAC CCG AAA GTA CGT 
Asn Trp Glu Asn Pro Lys Val Arg 

TGG CTG GAC AAA GGA GTA GAT GGG 
Trp Leu Asp Lys Gly Val Asp Gly 

TCA AAA AGA AAC TTC GAA GAT TCA 
Ser Lys Arg Asn Phe Glu Asp Ser 

GAT AAC GTC TAC GCC AAT GGC CCG 
Asp Asn Val Tyr Ala Asn Gly Pro 

AAC CGT GAA GTA CTG AGT AAG TAC 
Asn Arg Glu Val Leu Ser Lys Tyr 
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TGG AAG CCT GGC AAA AAC GGA GGC CCA 
Trp Lys Pro Gly Lys Asn Gly Gly Pro 

AGT GGT AAT GCC TGG GAA TAC GAT GAG 
Ser Gly Asn Ala Trp Glu Tyr Asp Glu 

CTT TTC ACC AAA AAG CAA CCA GAC CTC 
Leu Phe Thr Lys Lys Gin Pro Asp Leu 

GAG GAG GTG CAC AAG CTG ATG AAG TAT 
Glu Glu Val His Lys Leu Met Lys Tyr 

TTC CGG ATG GAT GTG ATT TCC GTG ATT 
Phe Arg Met Asp Val He Ser Val He 

CCT TAC AAG GAC TTC AAC AAG ACC ATC 
Pro Tyr Lys Asp Phe Asn Lys Thr He 

CGT GTG CAG GAG TTT CTC CAG GAA ATG 
Arg Val Gin Glu Phe Leu Gin Glu Met 

GAT GTG ATG ACA GTA GGT GAG GGT CCA 
Asp Val Met Thr Val Gly Glu Gly Pro 
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GGT ATC AAT CTG GAA AGC GGC CTG 
Gly lie Asn Leu Glu Ser Gly Leu 

CTT AAT ATG ATT TTT CAT TTT GGG 
Leu Asn Met lie Phe His Phe Gly 

GGT AGA TTT GAT CCC AAG CCC ATC 
Gly Arg Phe Asp Pro Lys Pro lie 

AGG CTG TGG GAT GAG TAC CTT AAA 
Arg Leu Trp Asp Glu Tyr Leu Lys 

GGG AAT CAT GAT TTT CAG CGA ATC 
Gly Asn His Asp Phe Gin Arg lie 

TAC TGG AAA GAG TCC GCC AAA CTG 
Tyr Trp Lys Glu Ser Ala Lys Leu 

GGC ACG GTC TAC GTT TAC CAG GGT 
Gly Thr Val Tyr Val Tyr Gin Gly 
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CAA TAT GTA TCC AGC TCA GCG GAG GCT 

Gin Tyr Val Ser Ser Ser Ala Glu Ala 

CAC ATG TTT ATG GAT CAT GGA CCC GGA 

His Met Phe Met Asp His Gly Pro Gly 

GAT TTT CTG GAA TTC AAA AAA GTC TTC 

Asp Phe Leu Glu Phe Lys Lys Val Phe 

GAA GAG GGC TGG GGT AGC GTC TTT CTA 

Glu Glu Gly Trp Gly Ser Val Phe Leu 

GTT TCT CGC TTT GGG GAT GAC GGA GCG 

Val ser Arg Phe Gly Asp Asp Gly Ala 

CTG AGC TTG TTG CTA TTT AGC ATG CGC 

Leu Ser Leu Leu Leu Phe Ser Met Arg 

GAT GAA ATA GGT ATG ACC AAT GTG GCT 

Asp Glu He Gly Met Thr Asn Val Ala 



TTT GAC ACC ATA GAA GAA TAT GAC GAT GTG GAG ATC AAA AAT GCT TAC AAG 
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Phe Asp Thr lie Glu Glu Tyr Asp Asp Val Glu lie Lys Asn Ala Tyr Lys 

GAG TGG AAA GCT GAA GGA AAA GAC CTG GAT CAG TTT TTA AAG AAC GTC CAT 
Glu Trp Lys Ala Glu Gly Lys Asp Leu Asp Gin Phe Leu Lys Asn Val His 

ATC AAT GGC CGT GAC AAT GCC CGT ACA CCG CTG CAA TGG AAT GAT GCT GAG 
lie Asn Gly Arg Asp Asn Ala Arg Thr Pro Leu Gin Trp Asn Asp Ala Glu 

CAG GCT GGT TTT ACC TCA GGC ACT CCA TGG CTC AAA GTC AAC CCT AAC TAT 
Gin Ala Gly Phe Thr Ser Gly Thr Pro Trp Leu Lys Val Asn Pro Asn Tyr 

ACG GCA ATC AAT GTG GCT AGT CAG C-AA GGA GAT GAG AAC TCT ATT CTG GCA 
Thr Ala He Asn Val Ala Ser Gin Glu Gly Asp Glu Asn Ser He Leu Ala 

TTT TAT CGC CGG ATG GTG GCG ATG CGA AAG GAG CAC CCG ACA CTT GTT TAT 
Phe Tyr Arg Arg Met Val Ala Met Arg Lys Glu His Pro Thr Leu Val Tyr 

GGT GAT TTT GCC CCC ATT CAG GAA GAT CAT CCG AGT GTA TTT GCT TTT TGG 
Gly Asp Phe Ala Pro He Gin Glu Asp His Pro Ser Val Phe Ala Phe Trp 



AGA TGG GAT GAA GAG GCT GCA TAT TTA GTC TTA CTC AAT TTT TCT GAG GAG 
Arg Trp Asp Glu Glu Ala Ala Tyr Leu Val Leu Leu Asn Phe Ser Glu Glu 
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ACT CAG GAA TTT GGG 
Thr Gin Glu Phe Gly 

GTA GAG GCC AAT GAC 
Val Glu Ala Asn Asp 

CTA AAA CCG TGG CAG 
Leu Lys Pro Trp Gin 
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CTG GAC GAT CGA TTT GAT 
Leu Asp Asp Arg phe Asp 

TTT GAC TTT GGT GAG CCA 
Phe Asp Phe Gly Glu Pro 

GCG GTG TTG GCG CGT GTT 
Ala Val Leu Ala Arg Val 
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AGT AGT AAG CTT CGC ATA 
Ser ser Lys Leu Arg He 

CAA AGT GGA AAA GTG AAA 
Gin Ser Gly Lys Val Lys 

1S82 

CGG CAT ATT GAA TTG TAA 
Arg His He Glu Leu END 
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Thermotoga neapolitana (Clone #56GC2) Glycosidase 

1 

TCT TCT GAA CGA TTC TCC ACT GAG CAG AAA AGA CCA GAT CAT ACT CTT TGT 
Ser Ser Glu Arg Phe Ser Thr Glu Gin Lys Arg Pro Asp His Thr Leu Cys 

GGA CGG AAA AGA ACA TTC GGC AAA GAA GGT GGT TAT ACC ACC CTT CAA AGA 
Gly Arg Lys Arg Thr Phe Gly Lys Glu Gly Gly Tyr Thr Thr Leu Gin Arg 

GGA AAC GCT GGT CTT CAA AGT GAA CGG ACT GAA GAG GGG AGA GCA CCT CGT 
Gly Asn Ala Gly Leu Gin Ser Glu Arg Thr Glu Glu Gly Arg Ala Pro Arg 

ATC CAC CAG TCT GAA CAC GGG AAA AAC CAT CTA TGT GAG GTG ATC TGT GTG 
lie His Gin Ser Glu His Gly Lys Asn His Leu Cys Glu Val lie Cys Val 

GAG ATC TTC AAA AGA CCG TTC AGA GAA GGG AGC TTC GTT CTG AAA GAG AAG 
Glu lie Phe Lys Arg Pro Phe Arg Glu Gly Ser Phe Val Leu Lys Glu Lys 

GAC TAC ACC GTT GAG TTC GAG GTG GAG AAG ATC CAT CTT GGA TGG AAG ATT 
Asp Tyr Thr Val Glu Phe Glu Val Glu Lys He His Leu Gly Trp Lys He 

TCA GGG AGA GTG AAG GGA AAT CCC GGA AGG CTT GAG ATC TTT CGG ACA AAC 
Ser Gly Arg Val Lys Gly Asn Pro Gly Arg Leu Glu He Phe Arg Thr Asn 
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GCA CCG AAG AAA CTC CTC GTG AAC AAC TGG CAG TCC TGG GGA CCC TGC AGG 

Ala Pro Lys Lys Leu Leu Val Asn Asn Trp Gin Ser Trp Gly Pro Cys Arg 

GTG GTG GAT CTT CCA TCC TTC ACC CCA CCC GAG ATA GAT CCA AAC TGG CAG 
Val Val Asp Leu Pro Ser Phe Thr Pro Pro Glu lie Asp Pro Asn Trp Gin 

TAC ACG GCC TCT GTG GTA CCG GAT GTG ATC AAA AAC CGT CTT CAG AGT GAC 
Tyr Thr Ala Ser Val Val Pro Asp Val lie Lys Asn Arg Leu Gin Ser Asp 

TAC TTC GTG GCA GAG GAA GGG AGA GTA TAC GGT TTT TTG AGT TCG AAG ATC 
Tyr Phe Val Ala Glu Glu Gly Arg Val Tyr Gly Phe Leu Ser Ser Lys He 

GCA CAT CCT TTC TTT GCG GCA GAG AAT GGA GAA CTT GTT GCG TAT CTT GAG 
Ala His Pro Phe Phe Ala Ala Glu Asn Gly Glu Leu Val Ala Tyr Leu Glu 

TAC TTC GAT GTG AAT TTC GAT GAC TTC GTC CCG ATA GAA CCT TTT GTC GTC 
Tyr Phe Asp Val Asn Phe Asp Asp Phe Val Pro He Glu Pro Phe Val Val 

CTT GAA AAT CCA ATC ACC TCT CTC CTT CTG GAA AAG TAC GCT GAA CTC GTC 
Leu Glu Asn Pro He Thr Ser Leu Leu Leu Glu Lys Tyr Ala Glu Leu Val 

GGG AAG GAA AAC AGC GCG AGG ATT CCA AAA CGT ACA CCG GTT GGA TGG TGC 
Gly Lys Glu Asn Ser Ala Arg He Pro Lys Arg Thr Pro Val Gly Trp Cys 
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AGC TGG TAC CAC TAT TTC CTC GAT 
Ser Trp Tyr His Tyr Phe Leu Asp 

CTG GAA CTT GCA GGA GAG TTT CCC 
Leu Glu Leu Ala Gly Glu Phe Pro 

TAT GAA AAA GAC ATC GGA GAC TGG 
Tyr Glu Lys Asp lie Gly Asp Trp 

GTG GAC GAG ATG GCA AGG ACG ATA 
Val Asp Glu Met Ala Arg Thr lie 

TGG ACC GCA CCG TTC AGT GTT TCA 
Trp Thr Ala Pro Phe Ser Val Ser 

CCG GAC TGG GTC GTG AAG GAA AAC 
Pro Asp Trp Val Val Lys Glu Asn 

TGG AAC AGA AAG ATC TAC GCT CTT 
Trp Asn. Arg Lys lie Tyr Ala Leu 

TGG CTC TTC GAC CTC TTC AGC TCT 
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CTC ACC TGG GAG GAG ACT TTG AAG AAT 
Leu Thr Trp Giu tilu Thr Leu Lys Asn 

TTC GAG GTC TTT CAG ATA GAC GAC GCG 
Phe Glu Val Phe Gin He Asp Asp Ala 

CTC GTC ACG AAG AAA GAC TTC CCA TCT 
Leu Val Thr Lys Lys Asp Phe Pro Ser 

CAG GAG AAA GGC TTT GTT CCT GGT ATA 
Gin Glu Lys Gly Phe Val Pro Gly He 

GAA ACA TCG GAT GTG TTC AAC TCC TAT 
Glu Thr Ser Asp Val Phe Asn Ser Tyr 

GGA ATG CCA AAG ATG GCG TAC AGG AAC 
Gly Met Pro Lys Met Ala Tyr Arg Asn 

GAC CTT TCA AAC AAA GAA GTC CTG GAC 
Asp Leu Ser Asn Lys Glu Val Leu Asp 

CTC AAG AAG ATG GGC TAC AGA TAC TTC 
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Trp Leu Phe Asp Leu Phe Ser Ser 

AAG ATC GAC TTT CTC TTT GCA GGA 
Lys lie Asp Phe Leu Phe Ala Gly 

ATC ACA CCC GTT CAG GCG TTC AGA 
lie Thr Pro Val Gin Ala Phe Arg 

GTT GGA GAC TTG TTC ATA CTC GGA 
Val Gly Asp Leu Phe lie Leu Gly 

GGC TAC GTT GAC GGC ATG AGG ATA 
Gly Tyr Val Asp Gly Met Arg lie 

GAT CAA ATA GAA GAC AAC GGA GCA 
Asp Gin lie Glu Asp Asn Gly Ala 

GCC ATC ACA CGT TAC TTC ATG CAC 
Ala lie Thr Arg Tyr Phe Met His 

TGC CTC ATC CTG AGA GAG GAA AAA 
Cys Leu lie Leu Arg Glu Glu Lys 
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Leu Lys Lys Met Gly Tyr Arg Tyr Phe 

GCG ATT CCG GGT GAG AGG AAA GAA AAC 
Ala lie Pro Gly Glu Arg Lys Glu Asn 

AAG GGG ATG GAG GTG ATC AGA AAG GCG 
Lys Gly Met Glu Val He Arg Lys Ala 

TGT GGC TCT CCC CTT CTT CCT GCG GTG 
Cys Gly Ser Pro Leu Leu Pro Ala Val 

GGG CCG GAC ACC ACA CCC TTC TGG GGT 
Gly Pro Asp Thr Thr Pro Phe Trp Gly 

CCC GCT GCA AGA TGG GCT CTG AGA AAT 
Pro Ala Ala Arg Trp Ala Leu Arg Asn 

GAC AGA CTC TGG CTG AAC GAT CCG GAC 
Asp Arg Leu Trp Leu Asn Asp Pro Asp 

ACA GAA CTG ACC CCA AAA GAG AGA GAG 
Thr Glu Leu Thr Pro Lys Glu Arg Glu 
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CTC TAC TCG TAC ACC 
Leu Tyr Ser Tyr Thr 

GAC CTG TCA CTT GTG 
Asp Leu Ser Leu Val 

GAT CTT CTC GGG GGA 
Asp Leu Leu Gly Gly 

AAG TAC GAG ATC GTC 
Lys Tyr Glu lie Val 

GTC GAT CTC AAA AAC 
Val Asp Leu Lys Asn 

CTG AGA AAG AAG GTT 
Leu Arg Lys Lys Val 
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TGT GGG ATC CTC GAC AAC 

Cys Gly lie Leu Asp Asn 

AAA GAG CAC GGA AGG AAG 
Lys Glu His Gly Arg Lys 

AAG CCC CGT GTT CTG AAC 
Lys Pro Arg Val Leu Asn 

TCG TCT GGC ACG ATC TCT 
Ser Ser Gly Thr lie Ser 

AGA GAG TAC CAT CTG GAA 
Arg Glu Tyr His Leu Glu 

GTC AAA AGA GAA GAC GGA 
Val Lys Arg Glu Asp Gly 
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ATG ATC ATA GAA AGT GAC 
Met lie He Glu Ser Asp 

GTT CTG AGA GAG ACA CTC 
Val Leu Arg Glu Thr Leu 

ATC ATG ACA GAG GAT CTG 
He Met Thr Glu Asp Leu 

GGA AAC ACC AGG CTC GTT 
Gly Asn Thr Arg Leu Val 

AAA GAG GGA AAG TCC TCT 

Lys Glu Gly Lys Ser Ser 

AGA AAC TTC TAC TTC TAC 
Arg Asn Phe Tyr Phe Tyr 



GAA GAG GGT GAG AGA GAA TGA 1856 
Glu Glu Gly Glu Arg Glu END 
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Thermozoga neapclizana (Clone # 56GP1) Glycosidase 

1 

ATG AGA AAA CTT GTG TTC TCA TTT TTG ATT GTG ACA TTG CCC ATC GTC CTC 
Met Arg Lys Leu Val Phe Ser Phe Leu He Val Thr Leu Pro He Val Leu 

TTT GCA AAC AGT GAT TTC GTG AAA GTG GAA AAC GGC AGG TTC ATA CTG AAC 
Phe Ala Asn Ser Asp Phe Val Lys Val Glu Asn Gly Arg Phe He Leu Asn 

GGA GAA GAG TTC AGA TTC GTT GGA AGC AAC AAC TAC TAC ATG CAC TAC AAG 
Gly Glu Glu Phe Arg Phe Val Gly Ser Asn Asn Tyr Tyr Met His Tyr Lys 

AGC AAT CGA ATG ATA GAC AGT GTC CTT GAA AGT GCA AAA GCC ATG GGG GTG 
Ser Asn Arg Met He Asp Ser Val Leu Glu Ser Ala Lys Ala Met Gly Val 

AAG GTG CTC AGA ATT TGG GGA TTC CTC GAT GGT GAG AGT TAC TGC CGT GAC 
Lys Val Leu Arg He Trp Gly Phe Leu Asp Gly Glu Ser Tyr Cys Arg Asp 

AAG AAC ACC TAC ATG CAC CCC GCA CCG GGA GTA TTT GGA TTG CCA GAG GGT 
Lys Asn Thr Tyr Met His Pro Ala Pro Gly Val Phe Gly Leu Pro Glu Gly 

ACG AAC GCT CAG GAC GGT TTT GAA AGA CTC GAC TAC ACG GTA GCG AAA GCA 
Thr Asn Ala Gin Asp Gly Phe Glu Arg Leu Asp Tyr Thr Val Ala Lys Ala 
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AAA GAA CTG GGC ATA AAG CTC ATA 
Lys Glu Leu Gly lie Lys Leu lie 

TTC GGT GGA ATG AAT CAA TAC GTG 
Phe Gly Gly Met Asn Gin Tyr Val 

GAC TTC TAC AGG AAC GAG AAG ATC 
Asp Phe Tyr Arg Asn Glu Lys lie 

TTC CTC ATA AAC AGG GTG AAC ACC 
Phe Leu lie Asn Arg Val Asn Thr 

CCC ACC ATC ATG GCA TGG GAA CTG 
Pro Thr lie Met Ala Trp Glu Leu 

AAG TCT GGT AAC ACA CTC GTT GAA 
Lys Ser Gly Asn Thr Leu Val Glu 

AAG AGT CTG GAT CCA AAC CAC CTG 
Lys Ser Leu Asp Pro Asn His Leu 

AAC AAC TAC GAA GGC TTC AGA CCT 
Asn Asn Tyr Glu Gly Phe Arg Pro 
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ATC GTT CTT GTG AAC AAC TGG GAC GAC 
lie Val Leu Val Asn Asn Trp Asp Asp 

AGA TGG TTT GGG GGC ATC CAT CAC GAT 
Arg Trp Phe Gly Gly He His His Asp 

AAA GAA GAA TAC AAA AAG TAC GTG TCT 
Lys Glu Glu Tyr Lys Lys Tyr Val Ser 

TAC ACG GGT GTT CCT TAC AGG GAA GAG 
Tyr Thr Gly Val Pro Tyr Arg Glu Glu 

GCG AAC GAG CCC AGG TGT GAA ACG GAC 
Ala Asn Glu Pro Arg Cys Glu Thr Asp 

TGG GTA GAG GAG ATG AGT GCT TAC ATA 
Trp Val Glu Glu Met Ser Ala Tyr He 

GTT GCC GTG GGA GAC GAG GGA TTC TTC 
Val Ala Val Gly Asp Glu Gly Phe Phe 

TAC GGT GGA GAG GCT GAG TGG GCC TAC 
Tyr Gly Gly Glu Ala Glu Trp Ala Tyr 
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AAC GGA TGG TCC GGT GTT GAC TGG 
Asn Gly Trp Ser Gly Val Asp Trp 

GAT TTT GGT ACG TTC CAT CTC TAC 
Asp Phe Gly Thr Phe His Leu Tyr 

AAC TAC GCA CAG TGG GGG GCA AAG 
Asn Tyr Ala Gin Trp Gly Ala Lys 

AAA GAG GTT GGA AAA CCC GTC GTT 
Lys Glu Val Gly Lys Pro Val Val 

GCC CCG GTC AAC AGG GTT GCC ATT 
Ala Pro Val Asn Arg Val Ala lie 

AAC CTC GGT GGA AAC GGT GCC ATG 
Asn Leu Gly Gly Asn Gly Ala Met 

GGA TGG GAC AGA GAC GAA AAG GGT 
Gly Trp Asp Arg Asp Glu Lys Gly 
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AAG AGA CTT CTG GAG ATA GAG ACG GTG 
Lys Arg Leu Leu Glu lie Glu Thr Val 

CCC TCC CAC TGG GGT GTG AGC CCT GAA 
Pro Ser His Trp Gly Val Ser Pro Glu 

TGG ATA GAA GAT CAC ATA AAG ATC GCA 
Trp lie Glu Asp His lie Lys He Ala 

CTG GAA GAG TAC GGT ATT CCC AAA AGT 
Leu Glu Glu Tyr Gly He Pro Lys Ser 

TAC AAA TTG TGG AAC GAT CTG GTC TAC 
Tyr Lys Leu Trp Asn Asp Leu Val Tyr 

TTC TGG ATG CTC GCA GGA ATC GGT GAA 
Phe Trp Met Leu Ala Gly He Gly Glu 

TAC TAC CCC GAT TAC GAC GGC TTC AGA 
Tyr Tyr Pro Asp Tyr Asp Gly Phe Arg 



ATA GTG AAC GAT GAA AGT GAA GAG GCA AAG TTG ATC AGA GAG TAC GCG AAA 
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He Val Asn Asp Glu Ser Glu Glu Ala Lys Leu He Arg Glu Tyr Ala Lys 



CTG TTC AGC ACG GGT GAG GAT ACG 
Leu Phe Ser Thr Gly Glu Asp Thr 

CCA AAG GAT GGT CAG GAG ATC AAA 
Pro Lys Asp Gly Gin Glu He Lys 

TTC GAC TAC AGC AAC ACG TTC AAA 
Phe Asp Tyr Ser Asn Thr Phe Lys 

CTC TTT GAA GAT GAG ATA AAA CAT 
Leu Phe Glu Asp Glu He Lys His 

TTT GAC ACA ACG CGG ATT TCA GAC 
Phe Asp Thr Thr Arg He Ser Asp 

CAT TTC AGO GGA GAA ACG GTG AAA 
His Phe Arg Gly Glu Thr Val Lys 

AGA GCG CAG TAT GTA CTC GCA GAA 
Arg Ala Gin Tyr Val Leu Ala Glu 



AGG GAA GAT ACC TGC ATG TTC ATC ACA 
Arg Glu Asp Thr Cys Met Phe He Thr 

AAG ACT GTG AAG GTG AGA GTG GGT GTC 
Lys Thr Val Lys Val Arg Val Gly Val 

GGA ATT TCC GTC GGG GTT GAA AAT CTG 
Gly He Ser Val Gly Val Glu Asn Leu 

CTC GGA TAT GGA GTT TAC GGA TTC GAA 
Leu Gly Tyr Gly Val Tyr Gly Phe Glu 

GGA GAA CAC GAG ATG TTC CTT GAG GCA 
Gly Glu His Glu Met Phe Leu Glu Ala 

GAC ACA ATC AGG GTG AAA GTT GTG AAC 
Asp Thr He Arg Val Lys Val Val Asn 

GAA GTG GAT TTT TCC AGA CCC GAA GAA 
Glu Val Asp Phe Ser Arg Pro Glu Glu 
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GTC AAG AAC TGG TGG AAC AGC GGA ACA TGG CAG GCT GAG TTC AAA ACA CCC 

Val Lys Asn Trp Trp Asn Ser Gly Thr Trp Gin Ala Glu Phe Lys Thr Pro 

GAT ATA GAG TGG AAC GGT GAG GTG GGG AAC GGT GCT CTC CAG ATG AAC GTG 

Asp He Glu Trp Asn Gly Glu Val Gly Asn Gly Ala Leu Gin Met Asn Val 

GTG CTT CCC GGA AAG GGT GAC TGG GAA GAG GTG AGG GTG GTC AGG AAA TTC 

Val Leu Pro Gly Lys Gly Asp Trp Glu Glu Val Arg Val Val Arg Lys Phe 

GAT CAA CTC CCC GTG TGT GAG ATC CTC GAG TAC GAT ATC TAC ATA CCA GAC 

Asp Gin Leu Pro Val Cys Glu He Leu Glu Tyr Asp He Tyr He Pro Asp 

GTT GAA GGG CTT ACA GGA AGG CTC AGA CCG TAC GCG GTG CTG AAT CCC GGC 

Val Glu Gly Leu Thr Gly Arg Leu Arg Pro Tyr Ala Val Leu Asn Pro Gly 

TGG GTG AAG ATA GGG CTC GAC ATG AAC AAC ACC TCG ATT GAC AGC GGA GAA 

Trp Val Lys He Gly Leu Asp Met Asn Asn Thr Ser He Asp Ser Gly Glu 

CTT GTC AGT TTC GAT GGC AAA AAG TAC AGA AAG TTC CAT GTG AGG ATC GAG 

Leu Val Ser Phe Asp Gly Lys Lys Tyr Arg Lys Phe His Val Arg He Glu 

TTC GAC AAG ACA CCT GGA GTG AAC GAG CTC CAC ATA GGT GTA GTT GGA GAC 

Phe Asp Lys Thr Pro Gly Val Asn Glu Leu His He Gly Val Val Gly Asp 
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CAC CTG GAG TAT GAT GGG CCG ATT 
His Leu Glu Tyr Asp Gly Pro lie 
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TTC ATC GAT AAT GTG AGG CTC TAT AAA 
Phe He Asp Asn Val Arg Leu Tyr Lys 



AAA TCT TCT TGA 2000 
Lys Ser Ser END 



WO 97/44361 , „ , PCT/US97/08793 

48/121 



ARP 2.3 (Alum Rock sulfur spring, Clone # 58GB3) Glycosidase 
1 

ATG CAT TTT AGC CCA CTA CAA TTG ATC CTC GTC TTA GTC ATT GTC ATT CTG 
Met His Phe Ser Pro Leu Gin Leu lie Leu Val Leu Val He Val He Leu 



CTG TTT GGC ACC AAA AAA TTA CGC AAT ATG GGC GGC GAT TTA GGC GAA GCC 
Leu Phe Gly Thr Lys Lys Leu Arg Asn Met Gly Gly Asp Leu Gly Glu Ala 



TTC AAG AAT TTC AGA AAA GCA GTC AAA 

Phe Lys Asn Phe Arg Lys Ala Val Lys 

AAA GAT GTT GCT C-TG CAA AAA GTT GAC 

Lys Asp Val Ala Val Gin Lys Val Asp 

CCA CAA GGT CGA GTC ATT GAT TCG GAA 

Pro Gin Gly Arg Val lie Asp Ser Glu 



GAC GGC GAT GAT GCT GAA ACA CAA 
Asp Gly Asp Asp Ala Glu Thr Gin 

CAA CAG CCA CCA GCA CAG CCC ATC 
Gin Gin Pro Pro Ala Gin Pro He 

254 

GCC AAG GAA AAG GAT AAG GTC TAA 
Ala Lys Glu Lys Asp Lys Val END 
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AEPII la (Clone # 63GA3) Glycosidase 

1 

ATG GAA GGA CTT CGA GGA GGT GTG AGG ATG AAG TTC CCA TCT AAC TTT CTT 
Met Glu Gly Leu Arg Gly Gly Val Arg Met Lys Phe Pro Ser Asn Phe Leu 

TTT GGC TAC TCC TGG TCG GGC TTC CAG TTT GAA ATG GGT TTA CCT GGG AGT 
Phe Gly Tyr Ser Trp Ser Gly Phe Gin Phe Glu Met Gly Leu Pro Gly Ser 

GAA GTT GAG AGC GAC TGG TGG GCA TGG GTC CAC GAT AAG GAG AAC ATC TTC 
Glu Val Glu Ser Asp Trp Trp Ala Trp Val His Asp Lys Glu Asn He Phe 

TCG GGC CTA GTT AGC GGT GAC CTA CCA GAG AAC GGG CCT GCT TAC TGG CAC 
Ser Gly Leu Val Ser Gly Asp Leu Pro Glu Asn Gly Pro Ala Tyr Trp His 

CTC TAC AAG AAA GAC CAC GAC ATA GCT GAA AGC CTT GGC ATG GAC GCG ATA 
Leu Tyr Lys Lys Asp His Asp He Ala Glu Ser Leu Gly Met Asp Ala He 

AGA GGC GGA ATC GAG TGG GCG AGG ATC TTC CCA AAA CCC ACC TTT GAC GTG 
Arg Gly Gly He Glu Trp Ala Arg He Phe Pro Lys Pro Thr Phe Asp Val 

AAG GTT GAC GTG GAA AAG GAC GAA AAC GGG AAC ATA ATC TCC ATT GAC GTC 
Lys Val Asp Val Glu Lys Asp Glu Asn Gly Asn He He Ser He Asp Val 
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CCG GAG AGC GCG ATA GAG GAG CTA 
Pro Glu Ser Ala lie Glu Glu Leu 

AAC CAC TAC CGC GAA ATC TAC TCG 
Asn His Tyr Arg Glu lie Tyr Ser 

ATA TTG AAC CTC TAT CAC TGG CCC 
lie Leu Asn Leu Tyr His Trp Pro 

GGC GTT AGA AAG CTC GGC CCT GAT 
Gly Val Arg Lys Leu Gly Pro Asp 

AGG AGC GTG GTG GAG TTC ACC AAG 
Arg Ser Val Val Glu Phe Thr Lys 

GAT GAC CTC GTT GAC ATG TGG AGC 
Asp Asp Leu Val Asp Met Trp Ser 

GAG CAG GGT TAC ACG AGG CCT CAG 
Glu Gin Gly Tyr Thr Arg Pro Gin 

CAC GAG GCC GCT GGA AAG GCG AAG 
His Glu Ala Ala Gly Lys Ala Lys 
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GAA AAG CTT GCC AAC ATG GAT GCC CTC 
Glu Lys Leu Ala Asn Met Asp Ala Leu 

GAC TGG AAG GAG AGG GGC AAG ACC TTC 
Asp Trp Lys Glu Arg Gly Lys Thr Phe 

CTT CCC CTC TGG CTC CAC GAC CCG ATA 
Leu Pro Leu Trp Leu His Asp Pro lie 

AGA GCT CCC TCG GGC TGG CTG GAC GAG 
Arg Ala Pro Ser Gly Trp Leu Asp Glu 

TTC GCT GCA TTC ATC GCC TAC CAC TTG 
Phe Ala Ala Phe lie Ala Tyr His Leu 

ACG ATG AAC GAG CCG AAT GTG GTT TAC 
Thr Met Asn Glu Pro Asn Val Val Tyr 

TCG GGC TTT CCA CCG GGT TAT CTC AGC 
Ser Gly Phe Pro Pro Gly Tyr Leu Ser 

CTC AAC CTC ATG CAG GCT CAC GCT AGA 
Leu Asn Leu Met Gin Ala His Ala Arg 
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GCT TAC GAT GCG ATA AAA GAG CAC TCG 
Ala Tyr Asp Ala lie Lys Glu His Ser 

TCC TTT GTC TGG CAC GAT GCC CTA AAC 
Ser Phe Val Trp His Asp Ala Leu Asn 

GAG ATA AGG AGG AGA CAC TAC GAC TTC 
Glu He Arg Arg Arg His Tyr Asp Phe 

TCG GAG TTC GGG GAG AGG GAG GAC TTC 
Ser Glu Phe Gly Glu Arg Glu Asp Phe 

GTG AAC TAC TAC ACT AGG GTT GCT TAC 
Val Asn Tyr Tyr Thr Arg Val Ala Tyr 

GCC CTA CCC GGG TAC GGC TAC ATG TGC 
Ala Leu Pro Gly Tyr Gly Tyr Met Cys 

GGA AGG CCC GCG AGC GAT TTT GGC TGG 
Gly Arg Pro Ala Ser Asp Phe Gly Trp 

AAC GTC CTG ATG GAT CTG AAG GAG CTC 



GAC AAG CCC GTG GGG TTG ATA TAC 
Asp Lys Pro Val Gly Leu He Tyr 

GAG GAA GCG GAG GAG ATT GTG AAG 
Glu Glu Ala Glu Glu He Val Lys 

GTA ACC GGC CTT CAC TCC GGC TCA 
Val Thr Gly Leu His Ser Gly Ser 

AAG GGG AAG ATC GAC TGG ATA GGC 
Lys Gly Lys He Asp Trp He Gly 

GAG ATG AGG AAC GGC CGC TTT ATG 
Glu Met Arg Asn Gly Arg Phe Met 

GAG AGG AGT GGT TAC GCA AAA TCC 
Glu Arg Ser Gly Tyr Ala Lys Ser 

GAG ACC TAT CCT GAG GGC CTC GAA 
Glu Thr Tyr Pro Glu Gly Leu Glu 

TAC GGC CTG CCA ATG ATG GTG ACG 
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Asn Val Leu Met Asp Leu Lys Glu 

GAG AAC GGG ATG GCG GAT ATG GCA 
Glu Asn Gly Met Ala Asp Met Ala 

AGC CAC CTC GCG GCT ATC CAC AGG 
Ser His Leu Ala Ala lie His Arg 

GGG TAC CTC CAC TGG TCT CTG ACC 
Gly Tyr Leu His Trp Ser Leu Thr 

AGA ATG CGC TTT GGG CTG GTG ATG 
Arg Met Arg Phe Gly Leu Val Met 

ATA AGG CCG AGC GCA CTC GTC TTC 
He Arg Pro Ser Ala Leu Val Phe 

CCC GAA GAG CTC TCC CAC CTA GCG 
Pro Glu Glu Leu Ser His Leu Ala 



Leu Tyr Gly Leu Pro Met Met Val Thr 

GAC AGG CAC CGC TCT TAC TAC CTC GTG 
Asp Arg His Arg Ser Tyr Tyr Leu Val 

GCG ATG GAG AAG GGT GCC GAC GTT AGG 
Ala Met Glu Lys Gly Ala Asp Val Arg 

GAC AAC TAC GAG TGG GCG CAG GGC TTC 
Asp Asn Tyr Glu Trp Ala Gin Gly Phe 

GTG GAC TTC GAG ACT AAG AAG CGC TAC 
Val Asp Phe Glu Thr Lys Lys Arg Tyr 

AGG GAG ATA GCC ACG CAG AAG GAA ATA 
Arg Glu He Ala Thr Gin Lys Glu He 

1478 

AAC CTC GAA CTG GTA ACG AAG AAG TAA 
Asn Leu Glu Leu Val Thr Lys Lys END 
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AEPII la (Clone # 63GA4) Glycosidase 

1 

ATG AAG TTC CCA TCT AAC TTT CTT TTT GGC TAC TCC TGG TCG GGC TTC CAG 
Met Lys Phe Pro Ser Asn Phe Leu Phe Gly Tyr Ser Trp Ser Gly Phe Gin 



TTT GAA ATG GGT TTA CCT GGG AGT GAA GTT GAG AGC GAC TGG TGG GCA TGG 
Phe Glu Met Gly Leu Pro Gly Ser Glu Val Glu Ser Asp Trp Trp Ala Trp 

GTC CAC GAT AAG GAG AAC ATC TTC TCG GGC CTA GTT AGC GGT GAC CTA CCA 
val His Asp Lys Glu Asn lie Phe Ser Gly Leu Val Ser Gly Asp Leu Pro 

GAG AAC GGG CCT GCT TAC TGG CAC CTC TAC AAG AAA GAC CAC GAC ATA GCT 
Glu Asn Gly Pro Ala Tyr Trp Kis Leu Tyr Lys Lys Asp Kis Asp lie Ala 

GAA AGC CTT GGC ATG GAC GCG ATA AGA GGC GGA ATC GAG TGG GCG AGG ATC 
Glu Ser Leu Gly Met Asp Ala He Arg Gly Gly He Glu Trp Ala Arg He 

TTC CCA AAA CCC ACC TTT GAC GTG AAG GTT GAC GTG GAA AAG GAC GAA AAC 
Phe Pro Lys Pro Thr Phe Asp val Lys Val Asp Val Glu Lys Asp Glu Asn 

GGG AAC ATA ATC TCC ATT GAC GTC CCG GAG AGC GCG ATA GAG GAG CTA GAA 
Gly Asn He He Ser He Asp Val Pro Glu Ser Ala He Glu Glu Leu Glu 
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AAG CTT GCC AAC ATG 
Lys Leu Ala Asn Met 

TGG AAG GAG AGG GGC 
Trp Lys Glu Arg Gly 

CCC CTC TGG CTC CAC 
Pro Leu Trp Leu His 

GCT CCC TCG GGC TGG 
Ala Pro Ser Gly Trp 

GCT GCA TTC ATC GCC 
Ala Ala Phe lie Ala 

ATG AAC GAG CCG AAT 
Met Asn Glu Pro Asn 

GGC TTT CCA CCG GGT 
Gly Phe Pro Pro Gly 

AAC CTC ATG CAG GCT 
Asn Leu MeC Gin Ala 
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GAT GCC CTC AAC CAC TAG 
Asp Ala Leu Asn His Tyr 

AAG ACC TTC ATA TTG AAC 
Lys Thr Phe He Leu Asn 

GAC CCG ATA GGC GTT AGA 
Asp Pro He Gly Val Arg 

CTG GAC GAG AGG AGC GTG 
Leu Asp Glu Arg Ser Val 

TAC CAC TTG GAT GAC CTC 
Tyr His Leu Asp Asp Leu 

GTG GTT TAC GAG CAG GGT 
Val Val Tyr Glu Gin Gly 

TAT CTC AGC CAC GAG GCC 
Tyr Leu Ser His Glu Ala 

CAC GCT AGA GCT TAC GAT 
His Ala Arg Ala Tyr Asp 
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CGC GAA ATC TAC TCG GAC 
Arg Glu He Tyr Ser Asp 

CTC TAT CAC TGG CCC CTT 
Leu Tyr His Trp Pro Leu 

AAG CTC GGC CCT GAT AGA 
Lys Leu Gly Pro Asp Arg 

GTG GAG TTC ACC AAG TTC 
Val Glu Phe Thr Lys Phe 

GTT GAC ATG TGG AGC ACG 
Val Asp Met Trp Ser Thr 

TAC ACG AGG CCT CAG TCG 
Tyr Thr Arg Pro Gin Ser 

GCT GGA AAG GCG AAG CTC 
Ala Gly Lys Ala Lys Leu 

GCG ATA AAA GAG CAC TCG 
Ala He Lys Glu His Ser 
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GAC AAG CCA GTT GGA GTT ATC TAC 
Asp Lys Pro Val Gly Val lie Tyr 

GAA GCT GCA GAG GAA TCC GTT CTG 
Glu Ala Ala Glu Glu Ser Val Leu 

GTT GAT GGT CTC TAC TCA GGC AAG 
Val Asp Gly Leu Tyr Ser Gly Lys 

TTC AAA GGC AGG GTC GAC TGG GTT 
Phe Lys Gly Arg Val Asp Trp Val 

TTT GGA AAG GCC GGA GAT TCA GTG 
Phe Gly Lys Ala Gly Asp Ser Val 

TCC CCG AGG GGT GGC TAC GCC AAA 
Ser Pro Arg Gly Gly Tyr Ala Lys 

TGG GAG ATT TAT CCT GAG GGC CTC 
Trp Glu lie Tyr Pro Glu Gly Leu 



GCA TAT AAG TGG ATT GAT GCG GAG GAT 
Ala Tyr Lys Trp lie Asp Ala Glu Asp 

GAA CTC CGC AGG AGG GAT TAC GAC TTC 
Glu Leu Arg Arg Arg Asp Tyr Asp Phe 

TCC CTG ACT GCA GGT GAG AGG GAG GAC 
Ser Leu Thr Ala Gly Glu Arg Glu Asp 

GGC GTC AAC TAC TAC TCC CGC CTG CTC 
Gly Val Asn Tyr Tyr Ser Arg Leu Leu 

AGA TTA CTT GAG GGC TAC GGT TTT GTC 
Arg Leu Leu Glu Gly Tyr Gly Phe Val 

TCG GGA AGG CCT GCG AGC GAT TTT GGC 
Ser Gly Arg Pro Ala Ser Asp Phe Gly 

GAA AAG CTC CTG GTT GAG CTG AGT GGC 
Glu Lys Leu Leu Val Glu Leu Ser Gly 



AGG TAC GAG CTT CCG CTC TTC ATA ACG GAG AAT GGT ATG GCT GAT GCT GTC 
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Arg Tyr Glu Leu Pro Leu Phe lie 

GAT AGG TAC AGG CCT TAC TAC CTC 
Asp Arg Tyr Arg Pro Tyr Tyr Leu 

GCG ATG GAG AAG GGT GCC GAC ATT 
Ala Met Glu Lys Gly Ala Asp He 

GAC AAC TAC GAG TGG GCG CAG GGC 
Asp Asn Tyr Glu Trp Ala Gin Gly 

GTG GAC TTC GAG ACT AAG AAG CGC 
Val Asp Phe Glu Thr Lys Lys Arg 

AGG GAA ATA GCC ACG CGG AAG GAA 
Arg Glu He Ala Thr Arg Lys Glu 

GAT GTG GAT GCA ATC ATT GCT CGG 
Asp Val Asp Ala He lie Ala Arg 
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Thr Glu Asn Gly Met Ala Asp Ala Val 

GTG AGC CAC CTC GCG GCT ATC CAC AGG 
Val Ser His Leu Ala Ala He His Arg 

AGG GGG TAC CTC CAC TGG TCT CTG ACC 
Arg Gly Tyr Leu His Trp Ser Leu Thr 

TTC AGA ATG CGC TTT GGG CTG GTG ATG 
Phe Arg Met Arg Phe Gly Leu Val Met 

TAC TTG AGG CCG AGC GCA CTC GTC TTC 
Tyr Leu Arg Pro Ser Ala Leu Val Phe 

ATA CCC GAA GAG CTT GAA CAC CTT GCC 
He Pro Glu Glu Leu Glu His Leu Ala 

TGA 1454 
END 
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AEPII la (Clone # 63GA9) Glycosidase 

1 

ATG CTA CCA GAA GAG TTC CTA TGG GGC GTT GGG CAG TCA GGC TTT CAG TTC 
Met Leu Pro Glu Glu Phe Leu Trp Gly Val Gly Gin Ser Gly Phe Gin Phe 

GAA ATG GGC GAC AAG CTC AGG AGG CAC ATC GAT CCA AAT ACC GAC TGG TGG 
Glu Met Gly Asp Lys Leu Arg Arg His lie Asp Pro Asn Thr Asp Trp Trp 

AAG TGG GTT CGC GAT CCT TTC AAC ATA AAA AAG GAG CTT GTG AGT GGG GAC 
Lys Trp Val Arg Asp Pro Phe Asn He Lys Lys Glu Leu Val Ser Gly Asp 

CTT CCC GAG GAC GGC ATC AAC AAC TAC GAA CTT TTT GAA AAC GAT CAC AAG 
Leu Pro Glu Asp Gly He Asn Asn Tyr Glu Leu Phe Glu Asn Asp His Lys 

CTC GCT AAA GGC CTT GGA CTC AAC GCA TAC AGG ATT GGA ATA GAG TGG AGC 
Leu Ala Lys Gly Leu Gly Leu Asn Ala Tyr Arg He Gly He Glu Trp Ser 

AGA ATC TTT CCC TGG CCG ACG TGG ACG GTC GAT ACC GAG GTC GAG TTC GAC 
Arg He Phe Pro Trp Pro Thr Trp Thr Val Asp Thr Glu Val Glu Phe Asp 

ACT TAC GGT TTA GTA AAG GAC GTT AAG ATA GAC AAG TCC ACC CTT GCT GAA 
Thr Tyr Gly Leu Val Lys Asp Val Lys He Asp Lys Ser Thr Leu Ala Glu 
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CTC GAC AGG CTG GCC ARC AAG GAG 
Leu Asp Arg Leu Ala Asn Lys Glu 

CAG CAT TTG AGG GAG CTC GGC TTC 
Gin His Leu Arg Glu Leu Gly Phe 

ACG CTT CCA ATA TGG CTC CAC GAC 
Thr Leu Pro He Trp Leu His Asp 

ACA AAC GAC AGA ATC GGC TGG GTC 
Thr Asn Asp Arg He Gly Trp Val 

AAG TAT GCT GCT TAC ATC GCC CAT 
Lys Tyr Ala Ala Tyr He Ala His 

AGC ACC TTC AAC GAA CCT ATG GTA 
Ser Thr Phe Asn Glu Pro Met Val 

TAC TCA GGA TTT CCC CCG GGA GTC 

Tyr Ser Gly Phe Pro Pro Gly Val 

ATC CTC AAC ATG ATA AAC GCC CAC 
He Leu Asn Met He Asn Ala His 
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GAG GTA ATG TAC TAC AGG CGC GTT ATT 
Glu Val Met Tyr Tyr Arg Arg Val He 

AAG GTC TTC GTT AAC CTC AAC CAC TTC 
Lys Val Phe Val Asn Leu Asn His Phe 

CCG ATA GTG GCA AGG GAG AAG GCC CTC 
Pro He Val Ala Arg Glu Lys Ala Leu 

TCC CAG AGG ACA GTT GTT GAG TTT GCC 
Ser Gin Arg Thr Val Val Glu Phe Ala 

GCG CTC GGA GAC CTC GTG GAC ACA TGG 
Ala Leu Gly Asp Leu Val Asp Thr Trp 

GTT GTG GAG CTC GGC TAC CTC GCC CCC 
Val Val Glu Leu Gly Tyr Leu Ala Pro 

ATG AAC CCC GAG GCC GCG AAG CTG GCG 
Met Asn Pro Glu Ala Ala Lys Leu Ala 

GCC TTG GCA TAT AAG ATG ATA AAG AGG 
Ala Leu Ala Tyr Lys Met He Lys Arg 



WO 97/44361 

59/121 

TTC GAC ACC AAG AAG GCC GAT GAG 
Phe Asp Thr Lys Lys Ala Asp Glu 

ATA ATC TAC AAC AAC ATC GGT GTT 
lie lie Tyr Asn Asn lie Gly Val 

AAG GAC GTT AAA GCA GCC GAA AAC 
Lys Asp Val Lys Ala Ala Glu Asn 

TTT GAT GCC ATC CAC AAG GGT AAG 
Phe Asp Ala He His Lys Gly Lys 

TTT GTA AAA GTT AGA CAC CTA AAA 
Phe Val Lys Val Arg Kis Leu Lys 

TAC ACC CGC GAG GTT GTT AGA TAT 
Tyr Thr Arg Glu val val Arg Tyr 

CTC ATA TCC TTC AAG GGC GTT CCC 
Leu He Ser Phe Lys Gly Val Pro 

ACG ACC TCC GCC GAT GGC ATG CCC 
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GAT AGC AAG TCC CCT GCG GAC GTT GGC 
Asp Ser Lys Ser Pro Ala Asp Val Gly 

GCC TAC CCT AAA GAC CCT AAC GAT CCC 
Ala Tyr Pro Lys Asp Pro Asn Asp Pro 

GAC AAC TAC TTC CAC AGC GGA CTG TTC 
Asp Asn Tyr Phe His Ser Gly Leu Phe 

CTC AAC ATA GAG TTC GAC GGC GAA AAC 
Leu Asn He Glu Phe Asp Gly Glu Asn 

GGC AAT GAC TGG ATA GGC CTC AAC TAC 
Gly Asn Asp Trp He Gly Leu Asn Tyr 

TCG GAG CCC AAG TTC CCA AGT ATA CCC 
Ser Glu Pro Lys Phe Pro Ser He Pro 

AAC TAC GGC TAC TCC TGC AGG CCC GGC 
Asn Tyr Gly Tyr Ser Cys Arg Pro Gly 

GTC AGC GAT ATC GGC TGG GAA GTC TAT 
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Thr Thr Ser Ala Asp 

CCC CAG GGA ATC TAC 
Pro Gin Gly lie Tyr 

GTT TAC GTC ACC GAG 
Val Tyr Val Thr Glu 

TAC TAC ATA GTC AGC 
Tyr Tyr lie Val Ser 

TAC CCC GTA AAA GGC 
Tyr Pro Val Lys Gly 

GCC CTC GGC TTC AGC 
Ala Leu Gly Phe Ser 

AAG GAG AGG ATC CCG 
Lys Glu Arg lie Pro 

CAG TCC AAC GGT GTT 
Gin Ser Asn Gly Val 
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Gly Met Pro Val Ser Asp 

GAC TCG ATA GTC GAG GCC 
Asp Ser He Val Glu Ala 

AAC GGT GTT GCG GAT TCC 
Asn Gly Val Ala Asp Ser 

CAC GTC TCA AAG ATA GAG 
His Val Ser Lys He Glu 

TAC ATG TAC TGG GCG CTT 
Tyr Met Tyr Trp Ala Leu 

ATG AGG TTT GGT CTC TAC 
Met Arg Phe Gly Leu Tyr 

AGG GAG AGA AGC GTT GAG 
Arg Glu Arg Ser Val Glu 

CCT AAG GAT ATC AAA GAG 
Pro Lys Asp lie Lys Glu 
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He Gly Trp Glu Val Tyr 

ACC AAG TAC AGT GTT CCT 
Thr Lys Tyr Ser Val Pro 

GCG GAC ACG CTG AGG CCA 
Ala Asp Thr Leu Arg Pro 

GAA GCC ATT GAG AAT GGA 
Glu Ala He Glu Asn Gly 

ACG GAT AAC TAC GAG TGG 
Thr Asp Asn Tyr Glu Trp 

AAG GTC GAC CTC ATC TCC 
Lys Val Asp Leu He Ser 

ATA TAT CGC AGG ATA GTG 
He Tyr Arg Arg He Val 

GAG TTC CTG AAG GGT GAG 
Glu Phe Leu Lys Gly Glu 
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GAG AAA TGA 153 8 

Glu. Lys END 
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AEPII la (Clone # 63GB1) Glycosidase 

1 

ATG CTA CCA GAA GAG TTC CTA TGG GGC GTT GGG CAG TCA GGC TTT CAG TTC 
Met Leu Pro Glu Glu Phe Leu Trp Gly Val Gly Gin Ser Gly Phe Gin Phe 

GAA ATG GGC GAC AAG CTC AGG AGG CAC ATC GAT CCA AAT ACC GAC TGG TGG 
Glu Met Gly Asp Lys Leu Arg Arg His lie Asp Pro Asn Thr Asp Trp Trp 

AAG TGG GTT CGC GAT CCT TTC AAC ATA AAA AAG GAG CTT GTG AGT GGG GAC 
Lys Trp Val Arg Asp Pro Phe Asn lie Lys Lys Glu Leu Val Ser Gly Asp 

CTT CCC GAG GAC GGC ATC AAC AAC TAC GAA CTT TTT GAA AAC GAT CAC AAG 
Leu Pro Glu Asp Gly He Asn Asn Tyr Glu Leu Phe Glu Asn Asp His Lys 

CTC GCT AAA GGC CTT GGA CTC AAC GCA TAC GGG ATT GGA ATA GAG TGG AGC 
Leu Ala Lys Gly Leu Gly Leu Asn Ala Tyr Gly He Gly He Glu Trp Ser 

AGA ATC TTT CCC TGG CCG ACG TGG ACG GTC GAT ACC GAG GTC GAG TTC GAC 
Arg He Phe Pro Trp Pro Thr Trp Thr Val Asp Thr Glu Val Glu Phe Asp 



ACT TAC 



Thr Tyr 



GGT TTA GTA AAG GAC GTT AAG ATA GAC 
Gly Leu Val Lys Asp Val Lys He Asp 



AAG TCC ACC CTT GCT GAA 
Lys Ser Thr Leu Ala Glu 
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CTC GAC AGG CTG GCC AAC AAG GAG 
Leu Asp Arg Leu Ala Asn Lys Glu 

CAG CAT TTG AGG GAG CTC GGC TTC 
Gin His Leu Arg Glu Leu Gly Phe 

ACG CTT CCA ATA TGG CTC CAC GAC 
Thr Leu Pro lie Trp Leu His Asp 

ACA AAC GAC AGA ATC GGC TGG GTC 
Thr Asn Asp Arg lie Gly Trp Val 

AAG TAT GCT GCT TAC ATC GCC CAT 
Lys Tyr Ala Ala Tyr He Ala Kis 

AGC ACC TTC AAC GAA CCT ATG GTA 
Ser Thr Phe Asn Glu Pro Met Val 

TAC TCA GGA TTT CCC CCG GGA GTC 
Tyr Ser Gly Phe Pro Pro Gly Val 

ATC CTC AAC ATG ATA AAC GCC CAC 
lie Leu Asn Met He Asn Ala His 
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GAG GTA ATG TAC TAC AGG CGC GTT ATT 
Glu Val Met Tyr Tyr Arg Arg Val He 

AAG GTC TTC GTT AAC CTC AAC CAC TTC 
Lys Val Phe Val Asn Leu Asn His Phe 

CCG ATA GTG GCA AGG GAG AAG GCC CTC 
Pro He Val Ala Arg Glu Lys Ala Leu 

TCC CAG AGG ACA GTT GTT GAG TTT GCC 
Ser Gin Arg Thr Val Val Glu Phe Ala 

GCG CTC GGA GAC CTC GTG GAC ACA TGG 
Ala Leu Gly Asp Leu Val Asp Thr Trp 

GTT GTG GAG CTC GGA TAC CTC GCC CCC 
Val Val Glu Leu Gly Tyr Leu Ala Pro 

ATG AAC CCC GAG GCC GCG AAG CTG GCG 
Met Asn Pro Glu Ala Ala Lys Leu Ala 

GCC TTG GCA TAT AAG ATG ATA AAG AGG 
Ala Leu Ala Tyr Lys Met He Lys Arg 
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TTC GAC ACC AAG AAG GCC GAT GAG 
Phe Asp Thr Lys Lys Ala Asp Glu 

ATA ATC TAC AAC AAC ATC GGT GTT 
lie lie Tyr Asn Asn lie Gly Val 

AAG GAC GTT AAA GCA GCC GAA AAC 
Lys Asp Val Lys Ala Ala Glu Asn 

TTT GAT GCC ATC CAC AAG GGT AAG 
Phe Asp Ala lie His Lys Gly Lys 

TTT GTA AAA GTT AGA CAC CTA AAA 
Phe Val Lys Val Arg His Leu Lys 

TAC ACC CGC GAG GTT GTT AGA TAT 
Tyr Thr Arg Glu Val Val Arg Tyr 

CTC ATA TCC TTC AAG GGC GTT CCC 
Leu lie Ser Phe Lys Gly Val Pro 
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GAT AGC AAG TCC CCT GCG GAC GTT GGC 
Asp Ser Lys Ser Pro Ala Asp Val Gly 

GCC TAC CCT AAA GAC CCT AAC GAT CCC 
Ala Tyr Pro Lys Asp Pro Asn Asp Pro 

GAC AAC TAC TTC CAC AGC GGA CTG TTC 
Asp Asn Tyr Phe His Ser Gly Leu Phe 

CTC AAC ATA GAG TTC GAC GGC GAA AAC 
Leu Asn He Glu Phe Asp Gly Glu Asn 

GGC AAT GAC TGG ATA GGC CTC AAC TAC 
Gly Asn Asp Trp He Gly Leu Asn Tyr 

TCG GAG CCC AAG TTC CCA AGT ATA CCC 
Ser Glu Pro Lys Phe Pro Ser He Pro 

AAC TAC GGC TAC TCC TGC AGG CCC GGC 
Asn Tyr Gly Tyr Ser Cys Arg Pro Gly 



ACG ACC TCC GCC GAT GGC ATG CCC GTC AGC GAT ATC GGC TGG GAA GTC TAT 
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Thr Thr Ser Ala Asp Gly Met Pro Val Ser Asp He Gly Trp Glu Val Tyr 

CCC CAG GGA ATC TAC GAC TCG ATA GTC GAG GCC ACC AAG TAC AGT GTT CCT 
Pro Gin Gly He Tyr Asp Ser He Val Glu Ala Thr Lys Tyr Ser Val Pro 

GTT TAC GTC ACC GAG AAC GGT GTT GCG GAT TCC GCG GAC ACG CTG AGG CCA 
Val Tyr Val Thr Glu Asn Gly Val Ala Asp Ser Ala Asp Thr Leu Arg Pro 

TAC TAC ATA GTC AGC CAC GTC TCA AAG ATA GAG GAA GCC ATT GAG AAT GGA 
Tyr Tyr He Val Ser His Val Ser Lys lie Glu Glu Ala He Glu Asn Gly 

TAC CCC GTA AAA GGC TAC ATG TAC TGG GCG CTT ACG GAT AAC TAC GAG TGG 
Tyr Pro Val Lys Gly Tyr Met Tyr Trp Ala Leu Thr Asp Asn Tyr Glu Trp 

GCC CTC GGC TTC AGC ATG AGG TTT GGT CTC TAC AAG GTC GAC CTC ATC TCC 
Ala Leu Gly Phe Ser Met Arg Phe Gly Leu Tyr Lys Val Asp Leu He Ser 

AAG GAG AGG ATC CCG AGG GAG AGA AGC GTT GAG ATA TAT CGC AGG ATA GTG 
Lys Glu Arg He Pro Arg Glu Arg Ser Val Glu He Tyr Arg Arg He Val 

CAG TCC AAC GGT GTT CCT AAG GAT ATC AAA GAG GAG TTC CTG AAG GGT GAG 
Gin Ser Asn Gly Val Pro Lys Asp He Lys Glu Glu Phe Leu Lys Gly Glu 
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GAG AAA TGA 153 6 

Glu Lys END 
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AEPII la (Clone # 63GP1) Glycosidase 

1 

ATG CGT CCA TTC TTG TTA ATT TCT ATT TTG GAC TTT CGA GTT GCT GAC TAC 
Met Arg Pro Phe Leu Leu lie Ser lie Leu Asp Phe Arg Val Ala Asp Tyr 

CTC CAA CGT AAC ATA AAG ACA CAA AAC CAA TAT TGG GCA TTG TGC GTA GTA 
Leu Gin Arg Asn He Lys Thr Gin Asn Gin Tyr Trp Ala Leu Cys Val Val 

ATG TTC TCC AAT GTT CTT AGA TGG CAA AAC TTA AAT ATT TCA CCA GCG GTG 
Met Phe Ser Asn Val Leu Arg Trp Gin Asn Leu Asn He Ser Pro Ala Val 

ATA CAT AGA GAC ACC GCT GAA CAC AGA GGT GAT TCC ATG AAG AAG TTT GTC 
He His Arg Asp Thr Ala Glu Kis Arg Gly Asp Ser Met Lys Lys Phe Val 

GCC CTG TTC ATA ACC ATG TTT TTC GTA GTG AGC ATG GCA GTC GTT GCA CAG 
Ala Leu Phe He Thr Met Phe Phe Val Val Ser Met Ala Val Val Ala Gin 

CCA GCT AGC GCC GCA AAG TAT TCC GAG CTC GAA GAA GGC GGC GTT ATA ATG 
Pro Ala Ser Ala Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met 

CAG GCC TTC TAC TGG GAC GTC CCA GGT GGA GGA ATC TGG TGG GAC ACC ATC 
Gin Ala Phe Tyr Trp Asp Val Pro Gly Gly Gly He Trp Trp Asp Thr He 
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AGG AGC AAG ATA CCG GAG TGG TAC 
Arg Ser Lys He Pro Glu Trp Tyr 

CCG CCA GCC AGC AAG GGG ATG AGC 
Pro Pro Ala Ser Lys Gly Met Ser 

TAC GAT TTC TTT GAC CTC GGC GAG 
Tyr Asp Phe Phe Asp Leu Gly Glu 

CGC TTT GGC TCT AAA CAG GAG CTC 
Arg Phe Gly Ser Lys Gin Glu Leu 

TAC GGC ATA AAG GTC ATA GCG GAC 

Tyr Gly He Lys Val He Ala Asp 

GAC CTC GAG TGG AAC CCG TTC GTT 
Asp Leu Glu Trp Asn Pro Phe Val 

AAG GTG GCC TCG GGC AAA TAT ACT 
Lys Val Ala Ser Gly Lys Tyr Thr 

GAG GTC AAG TGC TGT GAC GAG GGC 
Glu Val Lys Cys Cys Asp Glu Gly 
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GAG GCG GGA ATA TCC GCC ATT TGG ATT 
Glu Ala Gly He Ser Ala He Trp He 

GGC GGT TAC TCG ATG GGC TAC GAT CCC 
Gly Gly Tyr Ser Met Gly Tyr Asp Pro 

TAC AAC CAG AAG GGA ACC ATC GAA ACG 
Tyr Asn Gin Lys Gly Thr He Glu Thr 

ATC AAT ATG ATA AAC ACG GCC CAT GCC 
He Asn Met lie Asn Thr Ala His Ala 

ATC GTC ATA AAC CAC CGC GCA GGC GGA 
He Val He Asn His Arg Ala Gly Gly 

GGG GAC TAC ACC TGG ACG GAC TTC TCA 
Gly Asp Tyr Thr Trp Thr Asp Phe Ser 

GCC AAC TAC CTC GAC TTC CAC CCC AAC 
Ala Asn Tyr Leu Asp Phe His Pro Asn 

ACA TTT GGA GGC TTC CCA GAC ATA GCC 
Thr Phe Gly Gly Phe Pro Asp He Ala 
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CAC GAG AAG 
His Glu Lys 

GCC GCC TAC 
Ala Ala Tyr 

AAG GGC TAC 
Lys Gly Tyr 

TGG GCC GTT 
Trp Ala Val 

GCC TAC TCG 
Ala Tyr Ser 



AGC TGG GAC 
Ser Trp Asp 

CTA AGG AGC 
Leu Arg Ser 

GGA GCG TGG 
Gly Ala Trp 

GGC GAG TAC 
Gly Glu Tyr 

AGC GGC GCC 
Ser Gly Ala 
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CAG CAC TGG 
Gin His Trp 

ATC GGC GTT 

lie Gly Val 

GTC GTC AAG 
Val Val Lys 

TGG GAC ACC 
Trp Asp Thr 

AAG GTC TTC 

Lys Val Phe 



GAT GAG GCC TTT GAC AAC AAA AAC ATT 
Asp Glu Ala Phe Asp Asn Lys Asn lie 



CTC TGG GCG AGC GAT GAG AGC TAC 
Leu Trp Ala Ser Asp Glu Ser Tyr 

GAT GCC TGG CGC TTT GAC TAC GTG 
Asp Ala Trp Arg Phe Asp Tyr Val 

GAC TGG CTC AAC TGG TGG GGC GGC 
Asp Trp Leu Asn Trp Trp Gly Gly 

AAC GTT GAT GCA CTC CTC AAC TGG 
Asn Val Asp Ala Leu Leu Asn Trp 

GAC TTC CCG CTC TAC TAC AAG ATG 
Asp Phe Pro Leu Tyr Tyr Lys Met 

CCA GCG CTC GTC TCT GCC CTT CAG 
Pro Ala Leu Val Ser Ala Leu Gin 



AAC GGC CAG ACT GTT GTC TCC CGC GAC CCG TTC AAG GCC GTA ACC TTT GTA 
Asn Gly Gin Thr Val Val Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val 

GCA AAC CAC GAC ACC GAT ATA ATC TGG AAC AAG TAC CTT GCT TAT GCT TTC 
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Ala Asn His Asp Thr Asp lie lie 

ATC CTC ACC TAC GAA GGC CAG CCC 
lie Leu Thr Tyr Glu Gly Gin Pro 

TGG CTC AAC AAG GAC AGG TTG AAC 
Trp Leu Asn Lys Asp Arg Leu Asn 

GCA GGT GGA AGC ACG AGC ATA GTT 
Ala Gly Gly Ser Thr Ser lie Val 

GTG AGG AAC GGC TAT GGA AGC AAG 
Val Arg Asn Gly Tyr Gly Ser Lys 

GGC TCG AGC AAG GTT GGA AGG TGG 
Gly Ser Ser Lys Val Gly Arg Trp 

TGC ATC CAC GAG TAT ACT GGT AAC 
Cys He His Glu Tyr Thr Gly Asn 

TAC TCA AGC GGC TGG GTC TAT TTC 
Tyr Ser Ser Gly Trp Val Tyr Phe 
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Trp Asn Lys Tyr Leu Ala Tyr Ala Phe 

GTC ATA TTT TAC CGC GAC TAC GAG GAG 
Val He Phe Tyr Arg Asp Tyr Glu Glu 

AAC CTC ATA TGG ATA CAC GAC CAC CTC 
Asn Leu He Trp lie His Asp His Leu 

TAC TAC GAC AGC GAC GAG ATG ATT TTC 
Tyr Tyr Asp Ser Asp Glu Met lie Phe 

CCT GGC CTT ATA ACT TAC ATC AAC CTC 
Pro Gly Leu He Thr Tyr He Asn Leu 

GTT TAT GTG CCG AAG TTC GCG GGC GCG 
Val Tyr Val Pro Lys Phe Ala Gly Ala 

CTC GGA GGC TGG GTA GAC AAG TAC GTC 
Leu Gly Gly Trp Val Asp Lys Tyr Val 

GAA GCT CCA GCT TAC GAC CCT GCC AAC 
Glu Ala Pro Ala Tyr Asp Pro Ala Asn 
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GGG CAG TAT GGC TAC TCC GTG TGG AGC TAT TGC GGT GTT GGG TGA 1574 

Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly END 
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AEPII la (Clone # 63GP2) Glycosidase 

1 

ATG ATA AAC GTT GCA ACG GGA GAG GAG ACC CCA ATA CAC CTC TTT GGA GTC 
Met lie Asn Val Ala Thr Gly Glu Glu Thr Pro lie His Leu Phe Gly Val 

AAC TGG TTC GGC TTT GAG ACA CCG AAC TAC GTT GTT CAC GGC CTA TGG AGT 
Asn Trp Phe Gly Phe Glu Thr Pro Asn Tyr Val Val His Gly Leu Trp Ser 

AGG AAC TGG GAG GAC ATG CTC CTC CAG ATC AAG AGC CTT GGC TTC AAT GCG 
Arg Asn Trp Glu Asp Met Leu Leu Gin lie Lys Ser Leu Gly Phe Asn Ala 

ATA AGG CTT CCC TTC TGT ACC CAG TCA GTA AAA CCG GGG ACG ATG CCA ACG 
lie Arg Leu Pro Phe Cys Thr Gin Ser Val Lys Pro Gly Thr Met Pro Thr 

GCG ATT GAC TAC GCC AAG AAC CCA GAC CTC CAG GGT CTT GAC AGC GTC CAG 
Ala lie Asp Tyr Ala Lys Asn Pro Asp Leu Gin Gly Leu Asp Ser Val Gin 

ATA ATG GAG AAA ATA ATC AAG AAG GCT GGA GAC CTG GGC ATA TTC GTG CTC 
lie Met Glu Lys lie He Lys Lys Ala Gly Asp Leu Gly He Phe Val Leu 



CTC GAC TAC CAC AGA ATA GGA TGC AAC TTC ATA GAA CCC CTA TGG TAC ACC 
Leu Asp Tyr His Arg He Gly Cys Asn Phe He Glu Pro Leu Trp Tyr Thr 
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GAC AGC TTC TCG GAG CAG GAC TAC 
Asp Ser Phe Ser Glu Gin Asp Tyr 

AGG TTC GGC AAG TAC TGG AAC GTT 
Arg Phe Gly Lys Tyr Trp Asn Val 

CAC AGC TCA AGC CCC GCA CCT GCC 
His Ser Ser Ser Pro Ala Pro Ala 

TGG GGA ATG GGC AAC AAC GCC ACC 
Trp Gly Met Gly Asn Asn Ala Thr 

GGA AGG GCA ATT CTG GAG GTT GCC 
Gly Arg Ala He Leu Glu Val Ala 

ACC CAG TTC ACC ACC CCC GAG ATA 
Thr Gin Phe Thr Thr Pro Glu He 

GCC TGG TGG GGC GGA AAC CTT ATG 
Ala Trp Trp Gly Gly Asn Leu Met 

CCC AGG GAC AAG CTT GTT TAC AGC 
Pro Arg Asp Lys Leu Val Tyr Ser 
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ATA AAC ACC TGG GTT GAA GTC GCC CAG 
He Asn Thr Trp Val Glu Val Ala Gin 

ATC GGC GCG GAC CTG AAG AAC GAA CCC 
He Gly Ala Asp Leu Lys Asn Glu Pro 

GCC TAC ACT GAC GGA AGT GGG GCC ACG 
Ala Tyr Thr Asp Gly Ser Gly Ala Thr 

GAC TGG AAC CTG GCG GCT GAG AGG ATA 
Asp Trp Asn Leu Ala Ala Glu Arg He 

CCA CAA TGG GTT ATA TTT GTT GAG GGA 
Pro Gin Trp Val He Phe Val Glu Gly 

GAC GGT AGG TAC AAG TGG GGC CAC AAC 
Asp Gly Arg Tyr Lys Trp Gly His Asn 

GGT GTT AGG AAG TAC CCA GTT AAC CTG 
Gly Val Arg Lys Tyr Pro Val Asn Leu 

CCC CAA GTT TAC GGT CCA GAC GTT TAC 
Pro Gin Val Tyr Gly Pro Asp Val Tyr 
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GAC CAG CCC TAC TTT GAC CCC GGT 
Asp Gin Pro Tyr Phe Asp Pro Gly 

ATA TGG TAC CAC CAC TTC GGC TAC 
lie Trp Tyr His His Phe Gly Tyr 

GTT ATA GGT GAG TTC GGA GGC AAG 
Val lie Gly Glu Phe Gly Gly Lys 

GTC ACT TGG CAG AAC AAG ATA ATA 
Val Thr Trp Gin Asn Lys lie lie 

GAC TTC TTC TAC TGG AGC TGG AAC 
Asp Phe Phe Tyr Trp Ser Trp Asn 

CTG AAG GAT GAC TGG ACG ACA ATA 
Leu Lys Asp Asp Trp Thr Thr lie 

AGG CTC ATG GAC AGC TGT TCT GGA 
Arg Leu Met Asp Ser Cys Ser Gly 

ACA ACT ACA ACA ACA AGC ACA CCG 
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GAG GGG TTC CCC GAC AAC CTC CCC GAA 
Glu Gly Phe Pro Asp Asn Leu Pro Glu 

GTA AAG CTT GAT CTC GGT TAC CCT GTT 
Val Lys Leu Asp Leu Gly Tyr Pro Val 

TAC GGC CAT GGG GGA GAC CCG AGG GAT 
Tyr Gly His Gly Gly Asp Pro Arg Asp 

GAC TGG ATG ATC CAG AAC AAA TTC TGT 
Asp Trp Met lie Gin Asn Lys Phe Cys 

CCA AAC AGC GGT GAC ACC GGT GGA ATT 
Pro Asn Ser Gly Asp Thr Gly Gly lie 

TGG GAG GAC AAG TAC AAC AAC CTG AAG 
Trp Glu Asp Lys Tyr Asn Asn Leu Lys 

AAC GCC ACT GCC CCG TCC GTC CCC ACG 
Asn Ala Thr Ala Pro Ser Val Pro Thr 

CCA ACG ACC ACA ACG ACT ACA ACA TCC 
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Thr Thr Thr Thr Thr Ser Thr Pro Pro Thr Thr Thr Thr Thr Thr Thr Ser 

ACT CCA ACG ACC ACT ACC CAG ACC CCG ACC ACC ACT ACT CCA ACT ACG ACA 
Thr Pro Thr Thr Thr Thr Gin Thr Pro Thr Thr Thr Thr Pro Thr Thr Thr 

ACC ACC ACG ACC ACA ACT CCT TCA AAT AAC GTC CCA TTT GAA ATT GTG AAC 
Thr Thr Thr Thr Thr Thr Pro Ser Asn Asn Val Pro Phe Glu He Val Asn 

GTT CTC CCG ACT AGC TCC CAG TAC GAG GGA ACC AGC GTG GAG GTT GTA TGT 
Val Leu Pro Thr Ser Ser Gin Tyr Glu Gly Thr Ser Val Glu Val Val Cys 

GAT GGA ACC CAG TGT GCC TCC AGC GTT TGG GGA GCT CCG AAC CTC TGG GGA 
Asp Gly Thr Gin Cys Ala Ser Ser Val Trp Gly Ala Pro Asn Leu Trp Gly 

GTC GTT AAA ATC GGA AAC GCC ACC ATG GAC CCC AAC GTT TGG GGC TGG GAG 
Val Val Lys He Gly Asn Ala Thr Met Asp Pro Asn Val Trp Gly Trp Glu 

GAC GTT TAC AAG ACT GCA CCC CAG GAC ATT GGA ACC GGC AGC ACA AAG ATG 
Asp Val Tyr Lys Thr Ala Pro Gin Asp He Gly Thr Gly Ser Thr Lys Met 

GAG ATA AGG AAC GGG GTG CTC AAG GTT ACA AAC CTC TGG AAC ATC AAC ATG 
Glu He Arg Asn Gly Val Leu Lys Val Thr Asn Leu Trp Asn He Asn Met 
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AEPII la (Clone # 63GP4) Glycosidase 

1 

GCT GGA GTG GGT GAG CAA CGG GAT AAC CTA CCA GAT ATT CCC CGA CAG GTT 
Ala Gly Val Gly Glu Gin Arg Asp Asn Leu Pro Asp He Pro Arg Gin Val 

CAA CAA CGG AAA CAG GAG CAA CGA TGC CCT AGC TTT GGA CCA CGA CGA GCT 
Gin Gin Arg Lys Gin Glu Gin Arg Cys Pro Ser Phe Gly Pro Arg Arg Ala 

AAT TCT GAA CCA GGT CAA TCC AGG CAA ACC AAT CCT CTC CAA CTG GAG CGA 
Asn Ser Glu Pro Gly Gin Ser Arg Gin Thr Asn Pro Leu Gin Leu Glu Arg 

CCC TAT AAC GCC CCT CCA CTG CTG CCA CCA GTA CTT CGG CGG CGA CAT AAA 
Pro Tyr Asn Ala Pro Pro Leu Leu Pro Pro Val Leu Arg Arg Arg His Lys 

GGG AAT AAC GGA GAA GCT CGA CTA CCT TCA GAG CCT AGG TGT TAC TAT AAT 
Gly Asn Asn Gly Glu Ala Arg Leu Pro Ser Glu Pro Arg Cys Tyr Tyr Asn 

CTA CCT CAA CCC GAT TTT CCT CTC GGG AAG CGC CCA CGG CTA CGA CAC CTA 
Leu Pro Gin Pro Asp Phe Pro Leu Gly Lys Arg Pro Arg Leu Arg His Leu 

CGA CTA CTA CCG GCT TGA CCC CAA GTT CGG GAC CGA GGA GGA GCT GAG AGA 
Arg Leu Leu Pro Ala End Pro Gin Val Arg Asp Arg Gly Gly Ala Glu Arg 
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CAT CCG AAG TAT AAC ACA ATG GCA 
His Pro Lys Tyr Asn Thr Met Ala 

CCT TGG GGC AAC CAG CCA ATA AAC 
Pro Trp Gly Asn Gin Pro lie Asn 

GTC TCC CAG CTT CCG AGG ATA CTC 
Val Ser Gin Leu Pro Arg lie Leu 

AGC TTC CCG GGA AAC AAC TTC GCC 
Ser Phe Pro Gly Asn Asn Phe Ala 

AAC AAC ATG AGG GCA CCA GGC CAG 
Asn Asn Met Arg Ala Pro Gly Gin 

ACT GAC GGG CTC CAG GAG TCG TCG 
Thr Asp Gly Leu Gin Glu Ser Ser 

ATA AGC TTG CGG CCG CCA CCG CGG 
lie Ser Leu Arg Pro Pro Pro Arg 
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TAC CCG GAG GTC ATA TAC GGC GCC AAG 
Tyr Pro Glu Val lie Tyr Gly Ala Lys 

GCT CCG AAC TTC GTG CTC CCG ATA AAG 
Ala Pro Asn Phe Val Leu Pro lie Lys 

GTT GAC ACA AAG TAC ACG CTC GAA AAG 
Val Asp Thr Lys Tyr Thr Leu Glu Lys 

TTT GAG GCC TGG CTC TTC AAG GAT GCC 
Phe Glu Ala Trp Leu Phe Lys Asp Ala 

GGG GAC TAC GAG AGG AAT TCC GCC GAT 
Gly Asp Tyr Glu Arg Asn Ser Ala Asp 

CCA CCA ATC CCC ATA TGG AAA CCG TCG 
Pro Pro lie Pro He Trp Lys Pro Ser 

1886 

TGG AGC TCC AGC TTT TGT TCC CTT TAA 
Trp Ser Ser Ser Phe Cys Ser Leu END 
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GTT CCT CGA TGA GGC 
Val Pro Arg End Gly 

GCC CAA CCA CTG CGG 
Ala Gin Pro Leu Arg 

GGG CAA CGA AAG CCC 
Gly Gin Arg Lys Pro 

CAA GCT CGG CGA TGG 
Gin Ala Arg Arg Trp 

TCC AAA GCT CAA CAC 
Ser Lys Ala Gin Kis 

GGC CCT CCA CTG GAT 
Gly Pro Pro Leu Asp 

GAA CGA AGT CCT CGA 
Glu Arg Ser Pro Arg 

CAA GGA GAA AAA GCC 
Gin Gly Glu Lys Ala 
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ACA CAG GCG GGG AAT GAG 
Thr Gin Ala Gly Asn Glu 

CAT AGG GAA TCC AGC CTT 
His Arg Glu Ser Ser Leu 

ATA CTG GGA CTG GTT CTT 
He Leu Gly Leu Val Leu 

GAA CGC CTA CGT CGG CTG 
Glu Arg Leu Arg Arg Leu 

TGC CAA CCC GGA GGT CAG 
Cys Gin Pro Gly Gly Gin 

AGA GTT CGG CTT TGA CGG 
Arg Val Arg Leu End Arg 

CCC GGG AAC GTT CTT CCC 
Pro Gly Asn Val Leu Pro 

GGA CGC ATA CCT CGT CGG 
Gly Arg He Pro Arg Arg 
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GGT AAT TTT CGA TTT TGT 
Gly Asn Phe Arg Phe Cys 

CCT AGA AGT TTG GAA GAA 
Pro Arg Ser Leu Glu Glu 

CGT CAA GAA GTG GCC GTT 
Arg Gin Glu Val Ala Val 

GTG GGG CTT TGG GAG CCT 
Val Gly Leu Trp Glu Pro 

GGA ATA CCT GAT AGG AGC 
Gly He Pro Asp Arg Ser 

CAT CAG GGT TGA TGT GCC 
Kis Gin Gly End Cys Ala 

GGA GCT GAG AAA GGC AGT 

Gly Ala Glu Lys Gly Ser 

TGA GAT ATG GAC GCT CTC 
End Asp Met Asp Ala Leu 
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CCC TGA GTG GGT GAA AGG AGA CCG CTT CGA CTC CCT CAT GAA CTA CGC CCT 

Pro End Val Gly Glu Arg Arg Pro Leu Arg Leu Pro His Glu Leu Arg Pro 

CGG GAG GGA CAT CCT CCT GAA CTA CGC GAA GGG CCT GCT CAG TGG AGA AAG 
Arg Glu Gly His Pro Pro Glu Leu Arg Glu Gly Pro Ala Gin Trp Arg Lys 

TGC AAT GAA AAT GAT GGG ACG TTA CTA TGC TTC CTA CGG CGA GAA CGT ATT 
Cys Asn Glu Asn Asp Gly Thr Leu Leu Cys Phe Leu Arg Arg Glu Arg lie 

GCG ATG GGC TTC AAC CTC GTT GAT TCG CAC GAC ACT TCG AGG GTT CTC ACT 
Ala Met Gly Phe Asn Leu Val Asp Ser His Asp Thr Ser Arg Val Leu Thr 

GAT CTC GGT GGG GGG AGT CTC GGT GAC ACA CCG TCA AAC GAG TCA ATT CAG 
Asp Leu Gly Gly Gly Ser Leu Gly Asp Thr Pro Ser Asn Glu Ser lie Gin 

AGA CTC AAG CTC CTC TCA ACG TCC TCT ATG CCC TGC CTG GAA CTC CGG TCA 
Arg Leu Lys Leu Leu Ser Thr Ser Ser Met Pro Cys Leu Glu Leu Arg Ser 

CCT TCC AGG GGA TGA GAG AGG ACT GCT CGG AGA CAA GGG GCA CTA CGA CGA 
Pro Ser Arg Gly End Glu Arg Thr Ala Arg Arg Gin Gly Ala Leu Arg Arg 



ACA GCG CTA CCC AAT ACA GTG GGA TAC TGT GAA CGA AGA CGT CCT GAA CCA 
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Thr Ala Leu Pro Asn Thr 

TTA CAG GGC ATT GGC GGA 
Leu Gin Gly lie Gly Gly 

CGC AAT AAG GTT CTA CAC 
Arg Asn Lys Val Leu His 

GCA TCA TGA CGA GGT TCT 
Ala Ser End Arg Gly Ser 

ACT AAA GCT TCC TGA GGG 
Thr Lys Ala Ser End Gly 

CCC GGA ACT GCT TCG CGG 
Pro Gly Thr Ala Ser Arg 
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Val Gly Tyr Cys Glu 

GCT CAG AAA AAG AGT 
Ala Gin Lys Lys Ser 

TGC CAA AGG CGG CGT 
Cys Gin Arg Arg Arg 

TGT CGT TGC CAA CAG 
Cys Arg Cys Gin Gin 

AGA GTG GAA AGT AAT 
Arg Val Glu Ser Asn 

CAA AGT TGA AGT GCC 
Gin Ser End Ser Ala 



Arg Arg Arg Pro Glu Pro 

TCC TGC ATT GAG GAG CAG 
Ser Cys He Glu Glu Gin 

TAT GGC CTT CTT CAG GGG 
Tyr Gly Leu Leu Gin Gly 

CTG GAA GAA GCC AGC CCT 
Leu Glu Glu Ala Ser Pro 

CTG GCC TGA GAA TTT CAG 
Leu Ala End Glu Phe Gin 

AGC CAT AGG GAT AAT CAT 
Ser Kis Arg Asp Asn His 



CCT TGA GCG GAG TTG 
Pro End Ala Glu Leu 



1443 
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Bacillus thexmoloeovorans (Clone # 68GC1) Glycosidase 

1 

ATG ACT GAA TTA TAT ATA AAA AAT CCC CTG ATC GAA CAG CGG GCA GAT CCC 
Met Thr Glu Leu Tyr lie Lys Asn Pro Leu lie Glu Gin Arg Ala Asp Pro 

TGG ATC TAT AAA CAT ACC GAT GGT TAT TAT TAC TTT ACC GGT TCC GTG CCG 
Trp lie Tyr Lys His Thr Asp Gly Tyr Tyr Tyr Phe Thr Gly Ser Val Pro 

GAG TAC GAC CGA ATT GAG CTT AGA CGC TCG CAA ACG ATT CAA GGG CTT GCG 
Glu Tyr Asp Arg lie Glu Leu Arg Arg Ser Gin Thr lie Gin Gly Leu Ala 

GAT GCC GAA GGA ATT ACG ATC TGG CGC AAG CAT GAG TCA GGC CTG ATG AGT 
Asp Ala Glu Gly He Thr He Trp Arg Lys His Glu Ser Gly Leu Met Ser 

GCC AAC ATA TGG GCA CCC GAG ATT CAT TAT ATG GAT GGC AAA TGG TAT GTG 
Ala Asn He Trp Ala Pro Glu He His Tyr Met Asp Gly Lys Trp Tyr Val 

TAT TAC GCC GCT GCC CAT ACT TCA GAA ACG AGG GAC GGA TTG TTC GAT CAC 
Tyr Tyr Ala Ala Ala His Thr Ser Glu Thr Arg Asp Gly Leu Phe Asp His 



CGC ATG TTC GTA TTG GAG AAC GCT TCG GCG AAC CCG CTC GAA GGG GAA TGG 
Arg Met Phe Val Leu Glu Asn Ala Ser Ala Asn Pro Leu Glu Gly Glu Trp 
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GTG GAG AAG GGG CAA GTG ATC ACG 
Val Glu Lys Gly Gin Val He Thr 

ACG ACG TTC GAG CAT AAA GGC AAA 
Thr Thr Phe Glu His Lys Gly Lys 

CCG GGC ATT CCA GGC AAT TCC AAT 
Pro Gly lie Pro Gly Asn Ser Asn 

TGG ACC CTG ACA GGG GAA CAG GTA 
Trp Thr Leu Thr Gly Glu Gin Val 

GAG AAG ATC GGG TAT CTT GTG AAT 
Glu Lys He Gly Tyr Leu Val Asn 

GGG CGA ATA TTC ATG ACC TAT TCC 
Gly Arg He Phe Met Thr Tyr Ser 

ATG GGG CTG CTG ACA GCC GAT GAA 
Met Gly Leu Leu Thr Ala Asp Glu 

TGG GTC AAG TCG CCT GTA CCT GTA 
Trp Val Lys Ser Pro Val Pro Val 
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AAG TGG GAA TCT TTC GCC TTG GAC GCA 
Lys Trp Glu Ser Phe Ala Leu Asp Ala 

CGG TAC TAT GTA TGG GCT CAG AAA GAT 
Arg Tyr Tyr Val Trp Ala Gin Lys Asp 

CTG TAT ATC TCA TTG ATG GAA GAC CCG 
Leu Tyr He Ser Leu Met Glu Asp Pro 

TGC ATA TCG GTT CCC GAG TAC GAT TGG 
Cys He Ser Val Pro Glu Tyr Asp Trp 

GAA GGG GCC GCC GTT CTT AAG CGA AAC 
Glu Gly Ala Ala Val Leu Lys Arg Asn 

GCG AGC GCC ACG GAC CAC AAC TAT GCG 
Ala Ser Ala Thr Asp His Asn Tyr Ala 

GAC AGT GAT TTG CTG AAT CCG AGC TCC 
Asp Ser Asp Leu Leu Asn Pro Ser Ser 

TTT ACG ACA TCT GAA GCC AAT GGC CAA 
Phe Thr Thr Ser Glu Ala Asn Gly Gin 
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TAT GGT CCG GGG CAC AAC AGC TTC 
Tyr Gly Pro Gly His Asn Ser Phe 

ATT TTG GTA TAC CAT GCA AGA AGT 
lie Leu Val Tyr His Ala Arg Ser 

ATG ATC CGA ACC GTC ATA CGC GTG 
Met He Arg Thr Val He Arg Val 

GAA CGC CGA ATT TCG GGG TGC CAA 
Glu Arg Arg He Ser Gly Cys Gin 
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ACG ATT TCC GAG GAC GGC TTG CAG GAC 
Thr He Ser Glu Asp Gly Leu Gin Asp 

TAC AAG GAG ATC GTC GGG ATC CAC TAT 
Tyr Lys Glu He Val Gly He His Tyr 

TAC AGG TCA TCC GAT GGA ACG AAG ACG 
Tyr Arg Ser Ser Asp Gly Thr Lys Thr 

GAG CGG ATC ATG AAC CGG TCT CCA AGC 
Glu Arg He Met Asn Arg Ser Pro Ser 



CAT GAT GCC GAC TTT GTC ATT GGG GTT GTG ACC GGA AGG ATT AAC AAA CAT 
His Asp Ala Asp Phe Val He Gly Val Val Thr Gly Arg He Asn Lys His 



CAG ACC GAC TGA 1031 
Gin Thr Asp END 
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Thermotoga maritima (Clone # 6GA2) Glycosidase 

1 

TTG AAT AAC ACC ATT CCA AGA TGG CGT GGT TTC AAC CTT CTG GAG GCC TTT 
Leu Asn Asn Thr lie Pro Arg Trp Arg Gly Phe Asn Leu Leu Glu Ala Phe 

TCC ATT AAA AGT ACA GGA AAT TTT AAA GAG GAA GAT TTT TTG TGG ATG GCT 
Ser He Lys Ser Thr Gly Asn Phe Lys Glu Glu Asp Phe Leu Trp Met Ala 

CAG TGG GAC TTT AAT TTT GTT AGA ATC CCT ATG TGT CAT CTT CTC TGG TCA 
Gin Trp Asp Phe Asn Phe Val Arg He Pro Met Cys His Leu Leu Trp Ser 

GAC CGG GGC AAC CCA TTT ATT ATC AGA GAA GAT TTT TTT GAG AAA ATC GAT 
Asp Arg Gly Asn Pro Phe lie He Arg Glu Asp Phe Phe Glu Lys He Asp 

CGT GTA ATT TTC TGG GGA GAG AAA TAT GGA ATA CAT ATA TGT ATT TCT CTT 
Arg Val He Phe Trp Gly Glu Lys Tyr Gly He His He Cys He Ser Leu 

CAC AGG GCA CCT GGC TAT TCT GTT AAC AAG GAA GTA GAA GAG AAA ACC AAT 
His Arg Ala Pro Gly Tyr Ser Val Asn Lys Glu Val Glu Glu Lys Thr Asn 

CTG TGG AAA GAT GAA ACA GCT CAA GAA GCG TTC ATT CAT CAC TGG TCT TTT 
Leu Trp Lys Asp Glu Thr Ala Gin Glu Ala Phe He His His Trp Ser Phe 
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ATC GCA CGT CGT TAC AAA GGA ATT 
He Ala Arg Arg Tyr Lys Gly He 

ATA AAT GAG CCT CCA TTT CCT GAT 
He Asn Glu Pro Pro Phe Pro Asp 

AAC TCT CTT ATC AAG AGA ACT ATT 
Asn Ser Leu He Lys Arg Thr He 

AGA TTA ATT ATA ATA GAT GGA TTA 
Arg Leu He He He Asp Gly Leu 

TTA ACA ATT GAG AAT RCA GTG CAA 
Leu Thr lie Glu Asn Thr Val Gin 

GTT ACT CAT TAC AAA GCG GAA TGG 
Val Thr His Tyr Lys Ala Glu Trp 

GAG TGG CCA AAT GGA TGG CAT TTT 
Glu Trp Pro Asn Gly Trp His Phe 

TTG GAA CAT TAT TTA ACG TGG ATA 
Leu Glu His Tyr Leu Thr Trp He 
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TCT TCC ACA CAC CTG AGT TTT AAC TTA 
Ser Ser Thr His Leu Ser Phe Asn Leu 

CCA CAA ATC ATG AGT GTT GAA GAT CAC 
Pro Gin He Met Ser Val Glu Asp His 

ACA GAA ATT CGA AAA ATA GAT CCC GAA 
Thr Glu He Arg Lys He Asp Pro Glu 

GGC TAT GGG AAT ATT CCA GTG GAT GAT 
Gly Tyr Gly Asn He Pro Val Asp Asp 

TCA TGC AGA GGG TAC ATT CCC TTC AGT 
Ser Cys Arg Gly Tyr He Pro Phe Ser 

GTG GAT AGT AAG GAC TTT CCT GTT CCT 
Val Asp Ser Lys Asp Phe Pro Val Pro 

GGG GAA TAC TGG AAC AGA GAA AAG TTA 
Gly Glu Tyr Trp Asn Arg Glu Lys Leu 

AAA CTC AGA CAA AAA GGA ATA GAA GTA 
Lys Leu Arg Gin Lys Gly He Glu Val 
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TTC TGT GGA GAA ATG GGA GCT TAC 
Phe Cys Gly Glu Met Gly Ala Tyr 

AAA TGG CTT GAA GAT CTT TTA GAA 
Lys Trp Leu Glu Asp Leu Leu Glu 

GCC TTA TGG AAT TTT AGA GGT CCT 
Ala Leu Trp Asn Phe Arg Gly Pro 

GAC GTT GAA TAC GAA GAA TGG TAT 
Asp Val Glu Tyr Glu Glu Trp Tyr 

GAA CTA TTG AGA AAA TAT TAG 
Glu Leu Leu Arg Lys Tyr End 
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AAC AAA ACA CCT CAC GAT GTG GTT TTA 
Asn Lys Thr Pro His Asp Val Val Leu 

ATT TTT AAA ACT TTG AAC ATA GGG TTT 
lie Phe Lys Thr Leu Asn He Gly Phe 

TTT GGT ATT TTA GAT TCG GAA AGG AAA 
Phe Gly He Leu Asp Ser Glu Arg Lys 

GGA CAT AAA CTG GAT AGG AAA ATG TTG 
Gly His Lys Leu Asp Arg Lys Met Leu 

990 
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Thermotoga maritinia MSB8 (Clone # 6GC17) Glycosidase 

1 

ATG CTC TCA GAG ATT GTT CCG TAT ACT GTT CTG AGA AGA GAA AGA ATA GAA 
Met Leu Ser Glu He val Pro Tyr Thr Val Leu Arg Arg Glu Arg He Glu 

AGC TGG ATT TTC TCC GAT GAT GCT GTT GAG AGA ATC GTG GAT CCT TCC TTC 
Ser Trp lie Phe Ser Asp Asp Ala Val Glu Arg lie Val Asp Pro Ser Phe 

GAA TGG GAC TTC AGC TCC GCT CCC GTC CGG TTC AGG AAA GAG CTA GAG CCT 
Glu Trp Asp Phe Ser Ser Ala Pro Val Arg Phe Arg Lys Glu Leu Glu Pro 

TTC TCC GTC GCT GGA GAG CAG AGG GCC TAC CTG AAA CTC TGG TTC GGT GGT 
Phe Ser Val Ala Gly Glu Gin Arg Ala Tyr Leu Lys Leu Trp Phe Gly Gly 

GAA ACA CTC GTT CTG ATA GAT GGG AAG CCT TAC GGT GAG ATC AAC GAG TAT 
Glu Thr Leu Val Leu He Asp Gly Lys Pro Tyr Gly Glu He Asn Glu Tyr 

CAT AGG ATG TTG AAC ATC ACC CCC CTT GCT GAT GGA AAA CCA CAC ACG ATA 
His Arg Met Leu Asn He Thr Pro Leu Ala Asp Gly Lys Pro His Thr He 



GAA GCT CAG GTG ATG CCA AGG GGT CTC TTT GGA AAA CCA GAA AAG CCG GTG 
Glu Ala Gin Val Met Pro Arg Gly Leu Phe Gly Lys Pro Glu Lys Pro Val 
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TTC ACG GAA GCT TTC TTC ATC GTC 
Phe Thr Glu Ala Phe Phe lie Val 

AAA ACT CTC GAA CTC ACT ATA AAA 
Lys Thr Leu Glu Leu Thr lie Lys 

CTT TCT AAG AAA CTT CTG GAC ATC 
Leu Ser Lys Lys Leu Leu Asp lie 

ATC CCA AGA GAC ACA GGT ACC TAT 
lie Pro Arg Asp Thr Gly Thr Tyr 

ATA AAA GAT GAG ATC AAA AAC ACC 
lie Lys Asp Glu He Lys Asn Thr 
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GTT GAT GAA GCA CTG ATG AAG GTG GTG 
Val Asp Glu Ala Leu Met Lys Val Val 

ACG GCA GAA GTG ATA GAA GAC GAG TCG 
Thr Ala Glu Val lie Glu Asp Glu Ser 

TCC GAG GAG TTT CTC TCG AAA GTA TGG 
Ser Glu Glu Phe Leu Ser Lys Val Trp 

CTG ATG ACA GCA CTG GAG GAT CCG GGA 
Leu Met Thr Ala Leu Glu Asp Pro Gly 

TGG AAC ACA CCG GAG TTC AAA GAG TTC 
Trp Asn Thr Pro Glu Phe Lys Glu Phe 



ACA GGT GTG AAG CTT CCT GAA GAG 
Thr Gly Val Lys Leu Pro Glu Glu 

GAA AAA TTC AAA GAA AAG CTG GAT 
Glu Lys Phe Lys Glu Lys Leu Asp 

GGA ACG ATT CAC CTT GTG GGG CAC 
Gly Thr lie His Leu Val Gly His 



TTG AGA AAT CAG ATT CTG GAA GAG TTC 
Leu Arg Asn Gin He Leu Glu Glu Phe 

AGA ATA AGA AAA AAC CAT CCG GGT TTT 
Arg He Arg Lys Asn His Pro Gly Phe 

GCG CAC ATA GAC TAC GCC TGG CTC TGG 
Ala His He Asp Tyr Ala Trp Leu Trp 
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CCA GTT GAG GAG ACG AAG AGA AAG 
Pro Val Glu Glu Thr Lys Arg Lys 

TTG CTC TCT AAG CTT TAT CCG GAG 
Leu Leu Ser Lys Leu Tyr Pro Glu 

ATG TAC GAG GAT CTC AAG CAA AAT 
Met Tyr Glu Asp Leu Lys Gin Asn 

AAG CTC GTA GAA GAG GGG AGA TGG 

Lys Leu Val Glu Glu Gly Arg Trp 

TCG GAC TGC AAC GTT CCA TCG ATA 
Ser Asp Cys Asn Val Pro Ser He 

GGG CAA AAA TTC TTC GAA AGA GAA 
Gly Gin Lys Phe Phe Glu Arg Glu 

CTT CCG GAT GTG TTT GGG TTT TCC 
Leu Pro Asp Val Phe Gly Phe Ser 

GCC GGG ATA AAA TAC TTC GTC ACC 
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ATC CTA CGC ACT TTC GCA AAC TCT GTG 
He Leu Arg Thr Phe Ala Asn Ser Val 

TTC GTT TAC ACT CAG TCT TCT GCT CAG 
Phe Val Tyr Thr Gin Ser Ser Ala Gin 

TCA CCA GAG CTT TTC GAG GAA GTG AGA 
Ser Pro Glu Leu Phe Glu Glu Val Arg 

GAG CCA GTC GGT GGC ATG TGG GTG GAG 
Glu Pro Val Gly Gly Met Trp Val Glu 

GAG TCG CTT GTG AGA CAG TTC TAC TAT 
Glu Ser Leu Val Arg Gin Phe Tyr Tyr 

TTC GGG AAA AAG AGC AAG GTG TGC TGG 
Phe Gly Lys Lys Ser Lys Val Cys Trp 

TGG GTG CTT CCC CAA ATT CTG AAA GAA 
Trp Val Leu Pro Gin He Leu Lys Glu 

ACG AAA CTC AAC TGG AAC GAC ACG AAC 
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Ala Gly He Lys Tyr Phe Val Thr Thr Lys Leu Asn Trp Asn Asp Thr Asn 

GAG TTT CCG TAC GAT CTG TGC CGC TGG AGG GGA ATA GAT GGA TCC GAA GTG 
Glu Phe Pro Tyr Asp Leu Cys Arg Trp Arg Gly He Asp Gly Ser Glu Val 

ATC TAT TTC AGT TTC AAA AAT CCC AAC GAG GGG TAC AAC GGA AAG ATA GAT 
He Tyr Phe Ser Phe Lys Asn Pro Asn Glu Gly Tyr Asn Gly Lys He Asp 

CCC GAT ACG GTC TAC AAA ACC TGG AAG AAC TTC AGG CAG AAA GAT CTC ACA 
Pro Asp Thr Val Tyr Lys Thr Trp Lys Asn Phe Arg Gin Lys Asp Leu Thr 

AAC AGA GTT CTT CTT TCG TTC GGA CAC GGT GAT GGT GGT GGC GGT CCA ACC 
Asn Arg Val Leu Leu Ser Phe Gly His Gly Asp Gly Gly Gly Gly Pro Thr 

GAA GAG ATG CTG GAA AAT TAC GAG GTT CTG AAG GAT TTC CCT GGA CTA CCG 
Glu Glu Met Leu Glu Asn Tyr Glu Val Leu Lys Asp Phe Pro Gly Leu Pro 

CAC CTT GAA ATG GGA ACT GTG GAA GAA TTT TTC AAG AAG GTG GAG ATC GAC 
His Leu Glu Met Gly Thr Val Glu Glu Phe Phe Lys Lys val Glu He Asp 

GAA GAA CTC CCT GTG TGG GAC GGA GAG CTT TAC CTT GAA CTT CAC AGG GGA 
Glu Glu Leu Pro Val Trp Asp Gly Glu Leu Tyr Leu Glu Leu His Arg Gly 
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ACC TAC ACT TCT CAG TTC AGG ACA AAG AAA CTT CAC AAA GAA GCG GAA GAC 

Thr Tyr Thr Ser Gin Phe Arg Thr Lys Lys Leu His Lys Glu Ala Glu Asp 

AGT CTT TAT CTT GCA GAG TTG ATC TCG GCT TTC ACG GAT AAA GAT TTT TCG 
Ser Leu Tyr Leu Ala Glu Leu He Ser Ala Phe Thr Asp Lys Asp Phe Ser 

GAC GAA ATA GAC GAA CTC TGG AAG ATT CTG TTG AGA AAC GAA TTT CAC GAT 
Asp Glu He Asp Glu Leu Trp Lys He Leu Leu Arg Asn Glu Phe His Asp 

ATT CTA CCT GGA TCT TCT ATA AAG GAA GTC TAT GAA GAT ACA GAA AAA GAG 
He Leu Pro Gly Ser Ser He Lys Glu Val Tyr Glu Asp Thr Glu Lys Glu 

CTC AGA CAT GTG ATA GAA AAA TCA AAA GAC ATC GTT ATC GAA TCT CTC AAA 
Leu Arg His Val He Glu Lys Ser Lys Asp He Val He Glu Ser Leu Lys 

GTT CTT TCC TCT GAG AAC AAA GAT GTT CTA ACC ATT TTG AAC GCT TCA TCG 
Val Leu Ser Ser Glu Asn Lys Asp Val Leu Thr He Leu Asn Ala Ser Ser 

TTT CCA AAG AAG TGT CTT TTC TTC CTC AAC GAA GAT CTC GCG ATT TCC TTT 
Phe Pro Lys Lys Cys Leu Phe Phe Leu Asn Glu Asp Leu Ala He Ser Phe 

GAA GGA GAA GCA CTC TTG AAA CAG AAA ACT CAC GAT GGA AGG TAT GTG TAC 
Glu Gly Glu Ala Leu Leu Lys Gin Lys Thr His Asp Gly Arg Tyr Val Tyr 
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TTC ATA GAC AGG GAG ATT CCT CCG 
Phe He Asp Arg Glu He Pro Pro 

AAA GCC ACG TCT GAG GAA ACT CCA 
Lys Ala Thr Ser Glu Glu Thr Pro 

GAG AAC GAA TTT CTC AGG GTG CAC 
Glu Asn Glu Phe Leu Arg Val His 

TAC GAC AAA GAA CTG GAC AGG TAC 
Tyr Asp Lys Glu Leu Asp Arg Tyr 

AAA CTT CAT AAA AAC ATC CCT GCT 
Lys Leu His Lys Asn He Pro Ala 

AAC GTG GAA AAG ACA GGA TAT ACC 
Asn Val Glu Lys Thr Gly Tyr Thr 

GAG TCT GGC CCT GTT CGA GAA GTG 
Glu Ser Gly Pro Val Arg Glu Val 

AGC AGG ATC ACG CAG CAT TAC ATC 
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TTC ACG AAA GTA GAA CTG AAA GTT CGC 
Phe Thr Lys Val Glu Leu Lys Val Arg 

AGT GAG TTG AGA GAA ACA AAC ATC ATG 
Ser Glu Leu Arg Glu Thr Asn He Met 

GTC AAC GAT GAC GGA ACA ATT CAR ATC 
Val Asn Asp Asp Gly Thr He Gin He 

GTT TTC GAA GAG AAG GGA AAC ATC TTG 
Val Phe Glu Glu Lys Gly Asn He Leu 

TAC TGG GAC AAC TGG GAT ATC GCA GAA 
Tyr Trp Asp Asn Trp Asp He Ala Glu 

CTG AGG GCG AAA AAC ATA GAA AAA ATA 
Leu Arg Ala Lys Asn He Glu Lys He 

ATC CGT GTT GAA CAT GAA TCA GAA GGA 
He Arg Val Glu His Glu Ser Glu Gly 

CTT TAC AGA AAG AGT AGA AGG CTC GAT 
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Ser Arg He Thr Gin His Tyr He Leu Tyr Arg Lys Ser Arg Arg Leu Asp 



ATA GAA ACG AAG GTA GAC TGG CAC 
lie Glu Thr Lys Val Asp Trp His 

TTC CCA ACA ACT GTT CTG TCG AGA 
Phe Pro Thr Thr Val Leu Ser Arg 

TTC ATC GAA AGG CCC ACA CAC AGA 
Phe lie Glu Arg Pro Thr His Arg 

GAG GTG CCG TTT CAC AGG TGG ATG 
Glu Val Pro Phe His Arg Trp Met 

TCC ATT CTG AAC GAC GGA AAA TAC 
Ser He Leu Asn Asp Gly Lys Tyr 

GCG CTT TCA CTG ATA AAA GCG GGT 
Ala Leu Ser Leu He Lys Ala Gly 

GGC GAA CAC ACT TTC ACC TAT TCT 
Gly Glu His Thr Phe Thr Tyr Ser 



ACA AGG CGT GCG CTT CTC AGA GCC TAC 
Thr Arg Arg Ala Leu Leu Arg Ala Tyr 

AAG GCT AGG TTC GAT ATC TCC GGT GGT 
Lys Ala Arg Phe Asp He Ser Gly Gly 

AAC ACC AGT TTC GAA CAG GCG CGT TTC 
Asn Thr Ser Phe Glu Gin Ala Arg Phe 

GAT CTT TCC CAG ACA GAC TTC GGC GTG 
Asp Leu Ser Gin Thr Asp Phe Gly Val 

GGT GGC AGT GTT CAT CAG GGT ATC ATG 
Gly Gly Ser Val His Gin Gly He Met 

ATT TTC CCC GAT TTT CTC TGT GAC GAA 
He Phe Pro Asp Phe Leu Cys Asp Glu 

GTC TAC GTA CAC CCT GGA GAC AGC TTG 
Val Tyr Val His Pro Gly Asp Ser Leu 
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AGA GAT GTT GTA AAA 
Arg Asp Val Val Lys 

CGC GGG GTG TTG AAC 
Arg Gly Val Leu Asn 

TTC CGT CTC ACC TCA 
Phe Arg Leu Thr Ser 

GTT GAG ATT TTC GGA 
Val Glu He Phe Gly 

GGT GAA ATC TAT CAG 
Giy Glu He Tyr Gin 

TTC CCA GTG GTT TAC 
Phe Pro Val Val Tyr 
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GGA TCA GAA GAT CTC AAC 
Gly Ser Glu Asp Leu Asn 

CTC CCC TCT CCT TTA CTG 
Leu Pro Ser Pro Leu Leu 

CTG AGA AGG GTG AAG GAC 
Leu Arg Arg Val Lys Asp 

ACA TCA GGG AAA CTT TCC 
Thr Ser Gly Lys Leu Ser 

ACG AAC GTT CTG GAA GAG 
Thr Asn Val Leu Glu Glu 

CAT CCG TTC AAG ATC TAC 
His Pro Phe Lys He Tyr 
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AGA TCT TTC ATC GTT CAT 
Arg Ser Phe He Val His 

GAG ATC TCT CCT CAA AAC 
Glu He Ser Pro Gin Asn 

AAA ATT GTT TTG AGG CTT 
Lys He Val Leu Arg Leu 

ATT AAA CTC CCA TGG CAT 
He Lys Leu Pro Trp His 

AAA AAA CAG AAA GTC ACC 
Lys Lys Gin Lys Val Thr 

ACT TTT GTT GTA GAA GGT 
Thr Phe Val Val Glu Gly 



TGA 
END 
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Thermotoga maritima. MSB3 (Clone # 6GC18) Glycosidase 

1 

ATG GAA CTG TAC AGG GAT CCT TCG CAA CCC ATC GAA GTG AGA GTG AGA GAT 
Met Glu Leu Tyr Arg Asp Pro Ser Gin Pro lie Glu Val Arg Val Arg Asp 

CTT CTT TCC AGA ATG ACG CTG GAA GAG AAA GTG GCC CAG CTT GGG TCT GTC 
Leu Leu Ser Arg Met Thr Leu Glu Glu Lys Val Ala Gin Leu Gly Ser Val 

TGG GGT TAC GAA CTG ATA GAC GAG AGG GGA AAG TTC AGT AGA GAA AAA GCA 
Trp Gly Tyr Glu Leu lie Asp Glu Arg Gly Lys Phe Ser Arg Glu Lys Ala 

AAA GAA CTC CTC AAA AAT GGT ATA GGC CAG ATC ACA AGG CCT GGT GGA TCA 
Lys Glu Leu Leu Lys Asn Gly lie Gly Gin lie Thr Arg Pro Gly Gly Ser 

ACG AAC CTT GAA CCT CAA GAA GCC GCG GAA CTT GTG AAC GAA ATA CAG AGA 
Thr Asn Leu Glu Pro Gin Glu Ala Ala Glu Leu Val Asn Glu lie Gin Arg 

TTT CTT GTG GAA GAA ACA CGC CTT GGA ATT CCT GCG ATG ATA CAC GAA GAA 
Phe Leu Val Glu Glu Thr Arg Leu Gly lie Pro Ala Met lie His Glu Glu 

TGT CTC ACC GGT TAC ATG GGA CTT GGA GGA ACC AAC TTC CCT CAG GCG ATA 
Cys Leu Thr Gly Tyr Met Gly Leu Gly Gly Thr Asn Phe Pro Gin Ala He 
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GCA ATG GCG AGT ACA TGG GAT CCA GAT CTC ATA GAA AAA ATG ACC ACC GCC 

Ala Met Ala Ser Thr Trp Asp Pro Asp Leu He Glu Lys Met Thr Thr Ala 

GTC AGA GAG GAT ATG AGA AAG ATA GGG GCA CAT CAG GGT CTC GCA CCT GTT 
Val Arg Glu Asp Met Arg Lys He Gly Ala His Gin Gly Leu Ala Pro Val 

CTG GAT GTC GCA AGA GAT CCA AGG TGG GGG AGA ACA GAA GAG ACG TTC GGA 
Leu Asp Val Ala Arg Asp Pro Arg Trp Gly Arg Thr Glu Glu Thr Phe Gly 

GAA TCT CCC TAT CTG GTG GCG AGG ATG GGA GTC TCT TAC GTG AAA GGC CTC 
Glu Ser Pro Tyr Leu Val Ala Arg Met Gly Val Ser Tyr val Lys Gly Leu 

CAG GGG GAA GAT ATC AAA AAA GGT GTC GTT GCC ACA GTG AAA CAC TTC GCC 
Gin Gly Glu Asp He Lys Lys Gly Val Val Ala Thr Val Lys His Phe Ala 

GGA TAC AGC GCT TCT GAA GGT GGA AAG AAC TGG GCA CCA ACG AAC ATT CCG 
Gly Tyr Ser Ala Ser Glu Gly Gly Lys Asn Trp Ala Pro Thr Asn He Pro 

GAG AGG GAA TTC AAA GAG GTC TTT CTC TTT CCG TTC GAA GCG GCC GTT AAA 
Glu Arg Glu Phe Lys Glu Val Phe Leu Phe Pro Phe Glu Ala Ala Val Lys 

GAA GCG AAT GTG CTT TCT GTG ATG AAC TCC TAC AGC GAA ATA GAC GGT GTC 
Glu Ala Asn Val Leu Ser Val Met Asn Ser Tyr Ser Glu He Asp Gly Val 
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CCA TGT GCA GCG AAC AGG AAA CTC 
Pro Cys Ala Ala Asn Arg Lys Leu 

GGA TTC GAA GGA ATC GTC GTT TCT 
Gly Phe Glu Gly He Val Val Ser 

GAT TAT CAC AGA ATA GCA AGG GAT 
Asp Tyr His Arg He Ala Arg Asp 

GAA GCG GGG ATA GAT GTT GAA CTT 
Glu Ala Gly He Asp Val Glu Leu 

AAA GAC CTT GTT GAA AAA GGC ATC 
Lys Asp Leu Val Glu Lys Gly He 

GTC ACC AGG GTG CTG AGG CTG AAG 
Val Thr Arg Val Leu Arg Leu Lys 

TAC GTT GAG GTG GAA AAA GCA AAG 
Tyr Val Glu Val Glu Lys Ala Lys 



CTC ACA GAC ATT CTC AGA AAA GAC TGG 
Leu Thr Asp He Leu Arg Lys Asp Trp 

GAC TAT TTT GCT GTG AAA GTT CTG GAA 
Asp Tyr Phe Ala Val Lys Val Leu Glu 

AAG TCA GAA GCC GCA AGA CTC GCA CTT 
Lys Ser Glu Ala Ala Arg Leu Ala Leu 

CCG AAG ACA GAA TGT TAT CAA TAT TTG 
Pro Lys Thr Glu Cys Tyr Gin Tyr Leu 

ATC TCC GAA GCT TTG ATC GAC GAG GCA 
He Ser Glu Ala Leu He Asp Glu Ala 

TTC ATG CTC GGG CTC TTC GAA AAT CCC 
Phe Met Leu Gly Leu Phe Glu Asn Pro 

ATA GAA AGT CAC AGA GAC ATC GCA CTC 
He Glu Ser His Arg Asp He Ala Leu 



GAG ATA GCA AGG AAA TCC ATT ATC CTT CTC AAG AAT GAT GGA ATT CTG CCT 
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Glu lie Ala Arg Lys Ser He lie Leu Leu Lys Asn Asp Gly He Leu Pro 

CTT CAG AAA AAC AAA AAA GTT GCC CTG ATC GGA CCG AAC GCG GGT GAG GTG 
Leu Gin Lys Asn Lys Lys Val Ala Leu He Gly Pro Asn Ala Gly Glu Val 

AGA AAT CTC CTC GGA GAT TAC ATG TAC CTT GCA CAC ATA AGG GCT CTC CTC 
Arg Asn Leu Leu Gly Asp Tyr Met Tyr Leu Ala His He Arg Ala Leu Leu 

GAC AAC ATA GAC GAC GTC TTT GGA AAT CCT CAG ATC CCG AGA GAA AAC TAC 
Asp Asn He Asp Asp Val Phe Gly Asn Pro Gin He Pro Arg Glu Asn Tyr 

GAA AGA CTG AAG AAG AGC ATA GAA GAA CAT ATG AAG AGC ATT CCG AGT GTT 
Glu Arg Leu Lys Lys Ser He Glu Glu His Met Lys Ser He Pro Ser Val 

CTC GAT GCC TTC AAA GAA GAA GGG ATC GAA TTC GAA TAT GCA AAA GGC TGT 
Leu Asp Ala Phe Lys Glu Glu Gly He Glu Phe Glu Tyr Ala Lys Gly Cys 

GAA GTG ACA GGG GAA GAC AGA AGC GGT TTC GAA GAG GCG ATA GAA ATT GCA 
Glu Val Thr Gly Glu Asp Arg Ser Gly Phe Glu Glu Ala He Glu He Ala 

AAG AAA TCC GAC GTT GCC ATC GTT GTC GTA GGG GAC AAA TCT GGA CTC ACC 
Lys Lys Ser Asp Val Ala lie Val Val Val Gly Asp Lys Ser Gly Leu Thr 
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CTT GAC TGC ACA ACC GGT GAG TCC AGA GAC ATG GCA AAC CTC AAG CTT CCA 

Leu Asp Cys Thr Thr Gly Glu ser Arg Asp Met Ala Asn Leu Lys Leu Pro 

GGA GTC CAG GAA GAA CTC GTC CTC GAA GTT GCA AAG ACA GGA AAA CCC GTC 
Gly Val Gin Glu Glu Leu Val Leu Glu Val Ala Lys Thr Gly Lys Pro Val 

GTT CTT GTC CTC ATC ACG GGA AGA CCC TAT TCA CTC AAA AAC GTC GTC GAC 
Val Leu Val Leu lie Thr Gly Arg Pro Tyr Ser Leu Lys Asn Val Val Asp 

AAG GTG AAC GCG ATC CTT CAG GTG TGG CTT CCT GGA GAA GCG GGA GGA AGA 
Lys Val Asn Ala lie Leu Gin Val Trp Leu Pro Gly Glu Ala Gly Gly Arg 

GCG ATC GTT GAC ATC ATC TAT GGA AAG GTG AAT CCC TCT GGA AAA CTC CCG 
Ala He Val Asp He He Tyr Gly Lys Val Asn Pro Ser Gly Lys Lau Pro 

ATC AGC TTT CCA AGA AGC GCT GGT CAG ATT CCT GTC TTC CAC TAC GTC AAA 
He Ser Phe Pro Arg Ser Ala Gly Gin He Pro Val Phe His Tyr Val Lys 

CCA TCC GGG GGA AGG TCT CAC TGG CAC GGA GAC TAC GTG GAT GAG AGC ACA 
Pro Ser Gly Gly Arg Ser His Trp His Gly Asp Tyr Val Asp Glu Ser Thr 

AAG CCT CTC TTC CCG TTT GGG CAC GGT TTG TCT TAC ACG AAG TTC GAG TAC 
Lys Pro Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Thr Lys Phe Glu Tyr 
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AGC AAC CTC AGA ATC 
Ser Asn Leu Arg lie 

ATA AAG GTG GAC GTG 
He Lys Val Asp Val 

CAA CTT TAC ATC GGT 
Gin Leu Tyr He Gly 

CTG AAG GGC TTC AAG 
Leu Lys Gly Phe Lys 

GTG TTC AGG CTT CAC 
Val Phe Arg Leu His 

CTC GTG GTT GAA CCC 
Leu Val Val Glu Pro 

GAC ATC AGA CTC ACA 
Asp He Arg Leu Thr 

GTG GGA ATG AGG AAA 
Val Gly Met Arg Lys 
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GAG CCG AAG GAA GTG CCA 
Glu Pro Lys Glu Val Pro 

GAA AAC ATC GGA GAC AGA 
Glu Asn He Gly Asp Arg 

CGT GAG TTT GCA AGC GTC 
Arg Glu Phe Ala Ser Val 

AGG GTT TCT TTG AAG GCG 
Arg Val Ser Leu Lys Ala 

ATG GAC GTG CTC GCC TAC 
Met Asp Val Leu Ala Tyr 

GGT GAG TTC AAA GTG ATG 
Gly Glu Phe Lys Val Met 

GGT TCT TTC TCC GTC GTC 
Gly Ser Phe Ser Val Val 

TTC TTC ACG GAA GCC TGC 
Phe Phe Thr Glu Ala Cys 
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CCG GCC GGC GAA GTG GTG 
Pro Ala Gly Glu Val Val 

GAC GGA GAC GAG GTG GTT 
Asp Gly Asp Glu Val Val 

ACA AGG CCT GTG AAA GAG 
Thr Arg Pro Val Lys Glu 

AAA GAG AAG AAG ACT GTT 
Lys Glu Lys Lys Thr Val 

TAC AAC AGA GAC ATG AAA 
Tyr Asn Arg Asp Met Lys 

GTG GGA AGC TCT TCT GAA 
Val Gly Ser Ser Ser Glu 

GGT GAA AAA AGA GAA GTG 
Gly Glu Lys Arg Glu Val 

GAG GAG TGA 2336 
Glu Glu END 
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Thermotoga maritima MSB8 (Clone # 6GP2) Glycosidase 

1 

ATG GGG ATT GGT GGC GAC GAC TCC TGG AGC CCG TCA GTA TCG GCG GAA TTC 
Met Gly He Gly Gly Asp Asp Ser Trp Ser Pro Ser Val Ser Ala Glu Phe 

CTT TTA TTG ATC GTT GAG CTC TCT TTC GTT CTC TTT GCA AGT GAC GAG TTC 
Leu Leu Leu He Val Glu Leu Ser Phe Val Leu Phe Ala Ser Asp Glu Phe 

GTG AAA GTG GAA AAC GGA AAA TTC GCT CTG AAC GGA AAA GAA TTC AGA TTC 
Val Lys Val Glu Asn Gly Lys Phe Ala Leu Asn Gly Lys Glu Phe Arg Phe 

ATT GGA AGC AAC AAC TAC TAC ATG CAC TAC AAG AGC AAC GGA ATG ATA GAC 
He Gly Ser Asn Asn Tyr Tyr Met His Tyr Lys Ser Asn Gly Met He Asp 

AGT GTT CTG GAG AGT GCC AGA GAC ATG GGT ATA AAG GTC CTC AGA ATC TGG 
Ser Val Leu Glu Ser Ala Arg Asp Met Gly He Lys Val Leu Arg lie Trp 

GGT TTC CTC GAC GGG GAG AGT TAC TGC AGA GAC AAG AAC ACC TAC ATG CAT 
Gly Phe Leu Asp Gly Glu Ser Tyr Cys Arg Asp Lys Asn Thr Tyr Met His 

CCT GAG CCC GGT GTT TTC GGG GTG CCA GAA GGA ATA TCG AAC GCC CAG AGC 
Pro Glu Pro Gly Val Phe Gly Val Pro Glu Gly He Ser Asn Ala Gin Ser 
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GGT TTC GAA AGA CTC GAC TAC ACA 
Gly Phe Glu Arg Leu Asp Tyr Thr 

AAA CTT GTC ATT GTT CTT GTG AAC 
Lys Leu Val lie Val Leu Val Asn 

CAG TAC GTG AGG TGG TTT GGA GGA 
Gin Tyr Val Arg Trp Phe Gly Gly 

GAG AAG ATC AAA GAA GAG TAC AAA 
Glu Lys lie Lys Glu Glu Tyr Lys 

GTC AAT ACC TAC ACG GGA GTT CCT 
Val Asn Thr Tyr Thr Gly Val Pro 

TGG GAG CTT GCA AAC GAA CCG CGC 
Trp Glu Leu Ala Asn Glu Pro Arg 

CTC GTT GAG TGG GTG AAG GAG ATG 
Leu Val Glu Trp Val Lys Glu Met 

AAC CAC CTC GTG GCT GTG GGG GAC 
Asn His Leu Val Ala Val Gly Asp 
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GTT GCG AAA GCG AAA GAA CTC GGT ATA 
Val Ala Lys Ala Lys Glu Leu Gly He 

AAC TGG GAC GAC TTC GGT GGA ATG AAC 
Asn Trp Asp Asp Phe Gly Gly Met Asn 

ACC CAT CAC GAC GAT TTC TAC AGA GAT 
Thr His His Asp Asp Phe Tyr Arg Asp 

AAG TAC GTC TCC TTT CTC GTA AAC CAT 
Lys Tyr Val Ser Phe Leu Val Asn His 

TAC AGG GAA GAG CCC ACC ATC ATG GCC 
Tyr Arg Glu Glu Pro Thr He Met Ala 

TGT GAG ACG GAC AAA TCG GGG AAC ACG 
Cys Glu Thr Asp Lys Ser Gly Asn Thr 

AGC TCC TAC ATA AAG AGT CTG GAT CCC 
Ser Ser Tyr He Lys Ser Leu Asp Pro 

GAA GGA TTC TTC AGC AAC TAC GAA GGA 
Glu Gly Phe Phe Ser Asn Tyr Glu Gly 
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TTC AAA CCT TAC GGT GQA GAA GCC GAG TGG GCC TAC AAC GGC TGG TCC GGT 
Phe Lys Pro Tyr Gly Gly Glu Ala Glu Trp Ala Tyr Asn Gly Trp Ser Gly 

GTT GAC TGG AAG AAG CTC CTT TCG ATA GAG ACG GTG GAC TTC GGC ACG TTC 
Val Asp Trp Lys Lys Leu Leu Ser lie Glu Thr Val Asp Phe Gly Thr Phe 

CAC CTC TAT CCG TCC CAC TGG GGT GTC AGT CCA GAG AAC TAT GCC CAG TGG 
His Leu Tyr Pro Ser His Trp Gly Val Ser Pro Glu Asn Tyr Ala Gin Trp 

GGA GCG AAG TGG ATA GAA GAC CAC ATA AAG ATC GCA AAA GAG ATC GGA AAA 
Gly Ala Lys Trp He Glu Asp His He Lys He Ala Lys Glu He Gly Lys 

CCC GTT GTT CTG GAA GAA TAT GGA ATT CCA AAG AGT GCG CCA GTT AAC AGA 
Pro Val Val Leu Glu Glu Tyr Gly He Pro Lys Ser Ala Pro Val Asn Arg 

ACG GCC ATC TAC AGA CTC TGG AAC GAT CTG GTC TAC GAT CTC GGT GGA GAT 
Thr Ala He Tyr Arg Leu Trp Asn Asp Leu Val Tyr Asp Leu Gly Gly Asp 

GGA GCG ATG TTC TGG ATG CTC GCG GGA ATC GGG GAA GGT TCG GAC AGA GAC 
Gly Ala Met Phe Trp Met Leu Ala Gly He Gly Glu Gly Ser Asp Arg Asp 



GAG AGA GGG TAC TAT CCG GAC TAC GAC GGT TTC AGA ATA GTG AAC GAC GAC 
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Glu Arg Gly Tyr Tyr 

AGT CCA GAA GCG GAA 
Ser Pro Glu Ala Glu 

GAA GAC ATA AGA GAA 
Glu Asp lie Arg Glu 

GAG ATC AAA AAG ACC 
Glu lie Lys Lys Thr 

ACG TTT GAA AAG TTG 
Thr Phe Glu Lys Leu 

ATA GAG CAT CTC GGA 
He Glu His Leu Gly 

ATC CCG GAT GGA GAA 
He Pro Asp Gly Glu 

ACG GTG AAA GAC TCT 
Thr Val Lys Asp Ser 
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Pro Asp Tyr Asp Gly Phe 

CTG ATA AGA GAA TAC GCG 
Leu He Arg Glu Tyr Ala 

GAC ACC TGC TCT TTC ATC 
Asp Thr Cys Ser Phe He 

GTG GAA GTG AGG GCT GGT 
Val Glu Val Arg Ala Gly 

TCT GTC AAA GTC GAA GAT 
Ser Val Lys Val Glu Asp 

TAC GGA ATT TAC GGC TTT 
Tyr Gly He Tyr Gly Phe 

CAT GAA ATG TTC CTT GAA 
His Glu Met Phe Leu Glu 

ATC AAA GCG AAA GTG GTG 
He Lys Ala Lys Val Val 
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Arg He Val Asn Asp Asp 

AAG CTG TTC AAC ACA GGT 
Lys Leu Phe Asn Thr Gly 

CTT CCA AAA GAC GGC ATG 
Leu Pro Lys Asp Gly Met 

GTT TTC GAC TAC AGC AAC 
Val Phe Asp Tyr Ser Asn 

CTG GTT TTT GAA AAT GAG 
Leu Val Phe Glu Asn Glu 

GAT CTC GAC ACA ACC CGG 
Asp Leu Asp Thr Thr Arg 

GGC CAC TTT CAG GGA AAA 
Gly His Phe Gin Gly Lys 

AAC GAA GCA CGG TAC GTG 
Asn Glu Ala Arg Tyr Val 
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CTC GCA GAG GAA GTT 
Leu Ala Glu Glu Val 

AAC AGC GGA ACC TGG 
Asn Ser Gly Thr Trp 

GGT GAG GTG GGA AAT 
Gly Glu Val Gly Asn 

AGC GAC TGG GAA GAA 
Ser Asp Trp Glu Glu 

TGT GAG ATC CTC GAG 
Cys Glu lis Leu Glu 

GGA AGG TTG AGG CCG 
Gly Arg Leu Arg Pro 

CTC GAC ATG AAC AAC 
Leu Asp Met Asn Asn 

GGA AAA GAG TAC AGA 
Gly Lys Glu Tyr Arg 
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GAT TTT TCC TCT CCA GAA 

Asp Phe Ser Ser Pro Glu 

CAG GCA GAG TTC GGG TCA 
Gin Ala Glu Phe Gly Ser 

GGA GCA CTG CAG CTG AAC 
Gly Ala Leu Gin Leu Asn 

GTG AGA GTA GCA AGG AAG 
Val Arg Val Ala Arg Lys 

TAC GAC ATC TAC ATT CCA 
Tyr Asp lie Tyr lie Pro 

TAC GCG GTT CTG AAC CCC 
Tyr Ala Val Leu Asn Pro 

GCG AAC GTG GAA AGT GCG 
Ala Asn Val Glu Ser Ala 

AGA TTC CAT GTA AGA ATT 
Arg Phe His Val Arg lie 
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GAG GTG AAA AAC TGG TGG 
Glu Val Lys Asn Trp Trp 

CCT GAC ATT GAA TGG AAC 
Pro Asp lie Glu Trp Asn 

GTG AAA CTG CCC GGA AAG 
Val Lys Leu Pro Gly Lys 

TTC GAA AGA CTC TCA GAA 
Phe Glu Arg Leu Ser Glu 

AAC GTC GAG GGA CTC AAG 
Asn Val Glu Gly Leu Lys 

GGC TGG GTG AAG ATA GGC 
Gly Trp Val Lys lie Gly 

GAG ATC ATC ACT TTC GGC 
Glu He lie Thr Phe Gly 

GAG TTC GAC AGA ACA GCG 
Glu Phe Asp Arg Thr Ala 
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GGG GTG AAA GAA CTT 
Gly Val Lys Glu Leu 

GGA CCG ATT TTC ATC 
Gly Pro He Phe He 
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CAC ATA GGA GTT GTC GGT 
His He Gly Val Val Gly 

GAT AAT GTG AGA CTT TAT 
Asp Asn Val Arg Leu Tyr 
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GAT CAT CTG AGG TAC GAT 
Asp His Leu Arg Tyr Asp 

AAA AGA ACA GGA GGT ATG 
Lys Arg Thr Gly Gly Met 



TGA 
END 



2042 
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Polyangium brachysporum (Clone # 78GA1) Glycosidase 

1 

ATG TTC CTG CAT CCG AGG GGT CGC ATG ACC CGC CTA GCG CTC GGC TGT GCC 
Met Phe Leu His Pro Arg Gly Arg Met Thr Arg Leu Ala Leu Gly Cys Ala 

GTG CTG TGT CTG GCC GTC GCA GGC TGC GGT GGT GGT GAT GAC GAC GGC GAC 
Val Leu Cys Leu Ala Val Ala Gly Cys Gly Gly Gly Asp Asp Asp Gly Asp 

GAC AAC GGC ACC GCC CCC CAG CCC GCA CCT GGT CAA CCC GAG CCC CCG ACT 
Asp Asn Gly Thr Ala Pro Gin Pro Ala Pro Gly Gin Pro Glu Pro Pro Thr 

GAC ACC GTG CTG AAA GAC TGG CCT CGC ATC AAC AGC AGC ATC ACC GCC GAC 
Asp Thr Val Leu Lys Asp Trp Pro Arg He Asn Ser Ser He Thr Ala Asp 

GCA GCG ATC GAA AGC CGC GTC AAC TCA CTC GTC GCG GCG ATG ACG CTG GAA 
Ala Ala He Glu Ser Arg Val Asn Ser Leu Val Ala Ala Met Thr Leu Glu 

GAA AAA GTC GGC CAG ATG ACG CAG GTC GAA ATC CAG GAG GTG ACG CCG GAG 
Glu Lys Val Gly Gin Met Thr Gin Val Glu He Gin Glu Val Thr Pro Glu 

GAG ATC CGG CAG TAC CAC ATC GGC TCC GTG CTC AAC GGC GGT GGT TCG TTC 
Glu He Arg Gin Tyr His He Gly Ser Val Leu Asn Gly Gly Gly Ser Phe 
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CCG AAG CAG GAC AAG GGC GCG GCG 
Pro Lys Gin Asp L>ys Gly Ala Ala 

GCC TTG TGG GCC GCG TCG ATG GAT 
Ala Leu Trp Ala Ala Ser Met Asp 

ATC TGG GGC ACC GAC GCC GTC CAC 
He Trp Gly Thr Asp Ala Val His 

ATC TTC CCG CAC AAC ATC GGC CTG 
He Phe Pro His Asn He Gly Leu 

GCC CGC ATC GGC GCC GCC ACG GCG 
Ala Arg He Gly Ala Ala Thr Ala 

TGG GTG TTC GCG CCA ACG CTG GCG 
Trp Val Phe Ala Pro Thr Leu Ala 

AGC TAC GAA GGC TAT TCG GAA GAC 
Ser Tyr Glu Gly Tyr Ser Glu Asp 

AAG ATG GTC GAA GGC CTG CAG GGC 
Lys Met Val Glu Gly Leu Gin Gly 
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GTG ACC GAC TGG CTG GCG GTG GCC GAC 
Val Thr Asp Trp Leu Ala Val Ala Asp 

CCC GCC AAG CCG CGG CGC ATC CCG CTC 
Pro Ala Lys Pro Arg Arg He Pro Leu 

GGC CAC AAC AAC GTC AAG GGC GCG ACC 
Gly His Asn Asn Val Lys Gly Ala Thr 

GGC GCC GCG CGC GAC CCC GAC TTG GTC 
Gly Ala Ala Arg Asp Pro Asp Leu Val 

CTG GAA GTG GCA CGC ACC GGC ATC GAC 
Leu Glu Val Ala Arg Thr Gly He Asp 

GTC GTG CGC GAC GAC CGC TGG GGC CGC 
Val Val Arg Asp Asp Arg Trp Gly Arg 

CCC GAA ATC GTC GTC TCC TAT GCC GGC 
Pro Glu He Val Val Ser Tyr Ala Gly 

CGA TTG GCG CAG GAC GCG AAG GCC AAC 
Arg Leu Ala Gin Asp Ala Lys Ala Asn 
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GAG AAG GTG GTG GCC ACC GCC AAG 
Glu Lys Val Val Ala Thr Ala Lys 

CAG GGC AAG GAC CAG GGG GTC ACC 
Gin Gly Lys Asp Gin Gly Val Thr 

GTC CAT GCG CGC GGC TAC ATC CCC 
Val His Ala Arg Gly Tyr lie Pro 

ATG GCC TCC TTC ARC AGC TGG CAG 
Met Ala Ser Phe Asn ser Trp Gin 

GCC TTC AAG ATG CAT GGC AGC CGC 
Ala Phe Lys Met His Gly Ser Arg 

AAG ATG GGC TTC GAC GGT TTC GTG 
Lys Met Gly Phe Asp Gly Phe Val 

GTC ACC ACC GAG AAC AGC AAC GCG 
Val Thr Thr Glu Asn Ser Asn Ala 

CCC GAG GCC ATC AAC GCT GGC ATC 
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CAT TTC GTC GGC GAC GGC GGC ACC GAC 
His Phe Val Gly Asp Gly Gly Thr Asp 

CGG GTC ACC GAG CGC GAC CTG TTG AAC 
Arg Val Thr Glu Arg Asp Leu Leu Asn 

GCG CTC GAG GCG GGC GCG CAA ACC GTG 
Ala Leu Glu Ala Gly Ala Gin Thr Val 

GAC CCG TCG CAG GGC GAG GGC GCC AAG 
Asp Pro Ser Gin Gly Glu Gly Ala Lys 

TAC CTG CTC ACC GAG GCC CTC AAG CAG 
Tyr Leu Leu Thr Glu Ala Leu Lys Gin 

GTG TCC GAC TGG AAC GGC ATC GGC CAG 
Val Ser Asp Trp Asn Gly lie Gly Gin 

ACG CGC AAC TGC AGC AAC AGC GAC TGC 
Thr Arg Asn Cys Ser Asn Ser Asp Cys 

GAC ATG GTG ATG GTG CCG TAC CGG GCC 
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Pro Glu Ala He Asn Ala Gly He Asp Met Val Met Val Pro Tyr Arg Ala 

GAC TGG AAG GCC TTC ATC ACC AAC ACA ATT GCA ATT GTC CGC AAA GGC GAG 
Asp Trp Lys Ala Phe He Thr Asn Thr He Ala He Val Arg Lys Gly Glu 

ATC GCG CAG GAG CGC ATC GAC AAC GCG GTG CGG CGC ATC CTG CGC GTC AAG 
He Ala Gin Glu Arg He Asp Asn Ala Val Arg Arg He Leu Arg Val Lys 

TTG CGC GCC GGT CTG TTC GAC AAG CCC ACA CCC TCC GCC CGT CTG GCC TCG 
Leu Arg Ala Gly Leu Phe Asp Lys Pro Thr Pro Ser Ala Arg Leu Ala Ser 

CGC GAG GTC GGC AGC GCC GAA CAC CGG GCG CTC GCG CGT GAA GCG GTG CGC 
Arg Glu Val Gly Ser Ala Glu His Arg Ala Leu Ala Arg Glu Ala Val Arg 

AAG TCG TTG GTG CTG TTG AAG AAC AAC GGC CGG GTG CTG CCG CTG GCA CGC 
Lys Ser Leu Val Leu Leu Lys Asn Asn Gly Arg Val Leu Pro Leu Ala Arg 

AAT GCC AAG GTC CTG GTG GCC GGC AAG AGC GCC AAC AGC CTC GAG AAC CAG 
Asn Ala Lys Val Leu Val Ala Gly Lys Ser Ala Asn Ser Leu Glu Asn Gin 

ACC GGC GGC TGG TCG CTC AGC TGG CAA GGC ACC GGC AAC GCC AAC GCC GAT 
Thr Gly Gly Trp Ser Leu Ser Trp Gin Gly Thr Gly Asn Ala Asn Ala Asp 
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TTC GGC GGC GGC ACG 
Phe Gly Gly Gly Thr 

GCC GAA CTC GAC ACC 
Ala Glu Leu Asp Thr 

GCC GCG ATC GTC GTG 
Ala Ala He Val Val 

ATC GGC CGC AGC AAG 
He Gly Arg Sex Lys 

GCC GTG ATC GAA GGC 
Ala Val He Glu Gly 

CTG GTC TCC GGC CGC 
Leu Val Ser Gly Arg 

GCC TTC GTG GCG GCG 
Ala Phe Val Ala Ala 

GTG CTG TTC CGT GCG 
Val Leu Phe Arg Ala 
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ACC GTG TGG CAG GCG ATC 

Thr Val Trp Gin Ala He 

AGC GCC GAC GGC GCC AAG 
Ser Ala Asp Gly Ala Lys 

ATC GGT GAA ACA CCG TAC 
He Gly Glu Thr Pro Tyr 

ACG CTG GAA CTC ACC AAG 
Thr Leu Glu Leu Thr Lys 

CTG CGC GCC AAG GGC GTG 
Leu Arg Ala Lys Gly Val 

CCG CTC TAC GTC AAC AAG 
Pro Leu Tyr Val Asn Lys 

TGG CTG CCC GGC ACC GAA 
Trp Leu Pro Gly Thr Glu 

GCC GAC GGC AGC GTC GCG 
Ala Asp Gly Ser Val Ala 
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CAG AAG ATC GCC CCG AAT 
Gin Lys He Ala Pro Asn 

GGC AGC GAT GCC TAC GAC 
Gly Ser Asp Ala Tyr Asp 

GCC GAA GGT GTC GGA GAC 
Ala Glu Gly Val Gly Asp 

CTG CGT CCA GAA GAC CTC 
Leu Arg Pro Glu Asp Leu 

AAG AAA ATC GTC ACG CTG 
Lys Lys He Val Thr Leu 

GAG CTG AAC CGC TCG GAC 
Glu Leu Asn Arg Ser Asp 

GGC GAC GGC GTC GCC GAC 
Gly Asp Gly Val Ala Asp 

CAT GGC TTC AGC GGC AAG 
His Gly Phe Ser Gly Lys 
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CTG TCG TTC TCG TGG CCG AAG TCG GCC TGC CAG ACG CCG CTC AAC CGT GGC 
Leu Ser Phe Ser Trp Pro Lys Ser Ala Cys Gin Thr Pro Leu Asn Arg Gly 

GAC GCC ACC TAC GAC CCG CTC TAC GCT TAT GGC TAC GGC CTT CAR TAC GGC 
Asp Ala Thr Tyr Asp Pro Leu Tyr Ala Tyr Gly Tyr Gly Leu Gin Tyr Gly 

GAG GAG ACC GAT CAG AGC GCG TAC GAC GAA AGC AGT GCC ACG GTC GGC TGC 
Glu Glu Thr Asp Gin Ser Ala Tyr Asp Glu Ser Ser Ala Thr Val Gly Cys 

GGC ATC CAG GAC GGC GGC GGC ACC ACG GCC GAG CCG CTG GCG GTG TTC GAA 
Gly He Gin Asp Gly Gly Gly Thr Thr Ala Glu Pro Leu Ala Val Phe Glu 

GGC GGA GCC AAC CAG GGC AAC TGG AAG CTG CGC ATC GGC GCC GAG TCG AGC 
Gly Gly Ala Asn Gin Gly Asn Trp Lys Leu Arg He Gly Ala Glu Ser Ser 

TGG AGC AAC GAT GTG ACG CTG GCC AGC AGC GCG GTG ACG TCG ACG CCG TCC 
Trp Ser Asn Asp Val Thr Leu Ala Ser Ser Ala Val Thr Ser Thr Pro Ser 

AAC GAA CTG CAG GCC GTG CCG GTG GAC GAC AAG GCC GGG CGG CAA TGG GCG 
Asn Glu Leu Gin Ala Val Pro Val Asp Asp Lys Ala Gly Arg Gin Trp Ala 

GCG GTG AAG GCG ACC TGG AAC GAC AAG CCC GGC CAG CTC TAC ATG CAA AGC 
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Ala Val Lys Ala Thr Trp Asn Asp Lys Pro Gly Gin Leu Tyr Met Gin Ser 

GCC AAC CCC GGC GAC CTG GTG GAC CTG ATG GCC TAT CAG AAC TCC GGT GGC 
Ala Asn Pro Gly Asp Leu Val Asp Leu Met Ala Tyr Gin Asn Ser Gly Gly 

GCG CTG GTG TTC GAC CTG CGT GTC GTC AGT GCG CCG ACC GAC CCG GTC AAG 
Ala Leu Val Phe Asp Leu Arg Val Val Ser Ala Pro Thr Asp Pro Val Lys 

CTG CGC GTC GAT TGC GGC TGG CCC TGT CTG GGC GAG ATC GAC GTC ACC AGC 
Leu Arg Val Asp Cys Gly Trp Pro Cys Leu Gly Glu lie Asp Val Thr Ser 

GCC GTC AAG GCC CAG CCG GTC AAC GCC TGG AAG GAA GTG GCG GTG TCG CTG 
Ala Val Lys Ala Gin Pro Val Asn Ala Trp Lys Glu Val Ala Val Ser Leu 

CAG TGT TTC GCC GAC GCC GGC ACC GAC CTG GCC ATC GTC AAC ACG CCC TTC 
Gin Cys Phe Ala Asp Ala Gly Thr Asp Leu Ala lie Val Asn Thr Pro Phe 

CTG ATG TAC ACG TCT GGC CGC TTC GAA GCT GCC GTC GCG AAC ATC CGT TGG 
Leu Met Tyr Thr Ser Gly Arg Phe Glu Ala Ala Val Ala Asn lie Arg Trp 

GAG CCC AAG CGC ACG CCC AAC GTG GGG TGC AAC GGC GCA CCG ATC GCC GCC 
Glu Pro Lys Arg Thr Pro Asn Val Gly Cys Asn Gly Ala Pro lie Ala Ala 
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GCG CCT TGA 2711 
Ala Pro END 
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Pyrococcus furiosus (Clone # 7EG1) Glycosidase 

1 

ATG AGC AAG AAA AAG TTC GTC ATC GTA TCT ATC TTA ACA ATC CTT TTA GTA 
Met Ser Lys Lys Lys Phe Val He Val Ser He Leu Thr He Leu Leu Val 

CAG GCA ATA TAT TTT GTA GAA AAG TAT CAT ACC TCT GAG GAC AAG TCA ACT 
Gin Ala He Tyr Phe Val Glu Lys Tyr His Thr Ser Glu Asp Lys Ser Thr 

TCA AAT ACC TCA TCT ACA CCA CCC CAA ACA ACA CTT TCC ACT ACC AAG GTT 
Ser Asn Thr Ser Ser Thr Pro Pro Gin Thr Thr Leu Ser Thr Thr Lys Val 

CTC AAG ATT AGA TAC CCT GAT GAC GGT GAG TGG CCA GGA GCT CCT ATT GAT 
Leu Lys He Arg Tyr Pro Asp Asp Gly Glu Trp Pro Gly Ala Pro He Asp 

AAG GAT GGT GAT GGG AAC CCA GAA TTC TAC ATT GAA ATA AAC CTA TGG AAC 
Lys Asp Gly Asp Gly Asn Pro Glu Phe Tyr He Glu He Asn Leu Trp Asn 

ATT CTT AAT GCT ACT GGA TTT GCT GAG ATG ACG TAC AAT TTA ACC AGC GGC 
He Leu Asn Ala Thr Gly Phe Ala Glu Met Thr Tyr Asn Leu Thr Ser Gly 



GTC CTT CAC TAC GTC CAA CAA CTT GAC AAC ATT GTC TTG AGG GAT AGA AGT 
Val Leu His Tyr Val Gin Gin Leu Asp Asn He Val Leu Arg Asp Arg Ser 
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AAT TGG GTG CAT GGA TAC CCC GAA 
Asn Trp Val His Gly Tyr Pro Glu 

GCA AAC TAC GCA ACT GAT GGC CCA 
Ala Asn Tyr Ala Thr Asp Gly Pro 

CTA ACA GAC TTC TAT CTA ACA ATC 
Leu Thr Asp Phe Tyr Leu Thr lie 

CTG CCA ATT AAC TTC GCA ATA GAA 
Leu Pro He Asn Phe Ala He Glu 

ACA ACA GGA ATT AAC AGC GAT GAG 
Thr Thr Gly He Asn Ser Asp Glu 

GAC GGA TTA CAA CCG GCT GGC TCC 
Asp Gly Leu Gin Pro Ala Gly Ser 

ATA GTT AAC GGA ACA CCA GTA AAT 
He Val Asn Gly Thr Pro Val Asn 

ATT GGT TGG GAG TAT GTT GCA TTT 
He Gly Trp Glu Tyr Val Ala Phe 
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ATA TTC TAT GGA AAC AAG CCA TGG AAT 
lie Phe Tyr Gly Asn Lys Pro Trp Asn 

ATA CCA TTA CCC AGT AAA GTT TCA AAC 
He Pro Leu Pro Ser Lys Val Ser Asn 

TCC TAT AAA CTT GAG CCC AAG AAC GGC 
Ser Tyr Lys Leu Glu Pro Lys Asn Gly 

TCC TGG TTA ACG AGA GAA GCT TGG AGA 
Ser Trp Leu Thr Arg Glu Ala Trp Arg 

CAA GAA GTA ATG ATA TGG ATT TAC TAT 
Gin Glu Val Met lis Trp He Tyr Tyr 

AAA GTT AAG GAG ATT GTA GTC CCA ATA 
Lys Val Lys Glu He Val Val Pro He 

GCT ACA TTT GAA GTA TGG AAG GCA AAC 
Ala Thr Phe Glu Val Trp Lys Ala Asn 

AGA ATA AAG ACC CCA ATC AAA GAG GGA 
Arg He Lys Thr Pro He Lys Glu Gly 
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ACA GTG ACA ATT CCA 
Thr Val Thr He Pro 

AGC TTA CCA AAT TAC 
Ser Leu Pro Asn Tyr 

GAG TTT GGA ACG CCA 
Glu Phe Gly Thr Pro 

AAC ATA ACA CTA ACT 
Asn He Thr Leu Thr 
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TAC GGA GCA TTT ATA AGT 
Tyr Gly Ala Phe He Ser 

ACA GAA CTT TAC TTA GAG 
Thr Glu Leu Tyr Leu Glu 

AGC ACT ACC TCC GCC CAC 
Ser Thr Thr Ser Ala His 

CCT CTA GAT AGA CCT CTT 
Pro Leu Asp Arg Pro Leu 
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GTT GCA GCC AAC ATT TCA 
val Ala Ala Asn He Ser 

GAC GTG GAG ATT GGA ACT 
Asp Val Glu He Gly Thr 

CTA GAG TGG TGG ATC ACA 
Leu Glu Trp Trp He Thr 

ATT TCC TAA 960 
He Ser End 
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Vibrio harveyi (Clone # 91GP2) Glycosidase 

1 

ATG AGA GGT AAC ACG ATG AAG CAA AAA GCG CTA TAT CGA GCA GTA GCA ATG 
Met Arg Gly Asn Thr Met Lys Gin Lys Ala Leu Tyr Arg Ala Val Ala Met 

GGT TTG AGT GGT CTT GCG AAC GTC GCA TCC GCT AAT GAG ATG GTA AAT CCT 
Gly Leu Ser Gly Leu Ala Asn Val Ala Ser Ala Asn Glu Met Val Asn Pro 

GAT GGT GGT GTC GTA GTG GGT TAC TGG CAT AAC TGG TGC GAT GGC GCT GGT 
Asp Gly Gly Val Val Val Gly Tyr Trp His Asn Trp Cys Asp Gly Ala Gly 

TAC AAG GGA GGT AAT GCA CCG TGT GTA ACA TTG GAT GAA GTT GAT CCT ATG 
Tyr Lys Gly Gly Asn Ala Pro Cys Val Thr Leu Asp Glu Val Asp Pro Met 

TAC AAT GTG GTT AAC GTC TCC TTT ATG AAG GTA TTC AAT ACC AGT GAA GGT 
Tyr Asn Val Val Asn Val Ser Phe Met Lys Val Phe Asn Thr Ser Glu Gly 

CGT ATT CCA ACC TTT AAG CTC GAT CCA AAT ATC GGC CTT TCA GAA CAA CAA 
Arg He Pro Thr Phe Lys Leu Asp Pro Asn He Gly Leu Ser Glu Gin Gin 

TTT TTT GAC CAA ATT GAA GCT CTA AAC CAA CAA GGA CGT GCC GTT CTC ATC 
Phe Phe Asp Gin He Glu Ala Leu Asn Gin Gin Gly Arg Ala Val Leu He 
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GCT CTT GGT GGC GCA GAT GCT CAC 
Ala Leu Gly Gly Ala Asp Ala His 

GCG TTC GCA CAA GAG ATT ATT CGT 
Ala Phe Ala Gin Glu He He Arg 

CTA GAT ATC GAT TTA GAG CAG TCA 
Leu Asp He Asp Leu Glu Gin Ser 

GTA ATT CCA GCT GCA CTT CGC CTT 
Val He Pro Ala Ala Leu Arg Leu 

AAG AAC TTC CTA ATT ACG ATG GCG 
Lys Asn Phe Leu He Thr Met Ala 

GGC AAG TAT GTT CCT TAC ATT ACT 
Gly Lys Tyr Val Pro Tyr He Thr 

AAC CCT CAG TTT TAC AAT CAA GGT 
Asn Pro Gin Phe Tyr Asn Gin Gly 

GGT TGG ATA GCG CAA AAC AAT GAT 
Gly Trp He Ala Gin Asn Asn Asp 
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GTT GAA CTT AGA ACT GGT GAC GAA CAA 
Val Glu Leu Arg Thr Gly Asp Glu Gin 

TTA ACG GAT AAG TTC GGT TTT GAT GGT 
Leu Thr Asp Lys Phe Gly Phe Asp Gly 

GCA GTA ACG GCA GAG AAC AAC CAA ACC 
Ala Val Thr Ala Glu Asn Asn Gin Thr 

GTA AAA GAG CAT TAT CAA CAA CAA GGT 
Val Lys Glu His Tyr Gin Gin Gin Gly 

CCT GAA TTC CCT TAT CTA ACA GAA GGT 
Pro Glu Phe Pro Tyr Leu Thr Glu Gly 

GGT TTA GAA GGG TAC TAC GAT TGG ATC 
Gly Leu Glu Gly Tyr Tyr Asp Trp He 

GGT GAC GGT ATT TGG GTT GAT GGC GTG 
Gly Asp Gly He Trp Val Asp Gly Val 

GAG TTA AAA CAA GAG TTT ATT TAC TAC 
Glu Leu Lys Gin Glu Phe He Tyr Tyr 
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ATT TCG GAC GCT CTA TCG AAC GGT ACA CGC GGT TTC CAC AAA ATC CCG CAT 

lie Ser Asp Ala Leu Ser Asn Gly Thr Arg Gly Phe His Lys lie Pro His 

GAC AAA CTG GTG TTT GGT ATC CCA TCT AAC ATT GAT GCT GCT GCA ACG GGC 
Asp Lys Leu Val Phe Gly He Pro Ser Asn He Asp Ala Ala Ala Thr Gly 

TTT GTT CAA AAC CCT CAA GAC CTT TAC GAC GCG TTT GAT CAA CTT AAA GCG 
Phe Val Gin Asn Pro Gin Asp Leu Tyr Asp Ala Phe Asp Gin Leu Lys Ala 

CAA GGG CAG GCA CTT CGT GGC GTA ATG ACA TGG TCG GTG AAC TGG GAT ATG 
Gin Gly Gin Ala Leu Arg Gly Val Met Thr Trp Ser Val Asn Trp Asp Met 

GGC ACC GAT AAA AAT GGC CAA GCG TAC GGT GAA AAA TTC GTG AAG GAT TAC 
Gly Thr Asp Lys Asn Gly Gin Ala Tyr Gly Glu Lys Phe Val Lys Asp Tyr 

GGT CCG TTT ATC CAC GGG CAG ACT CCA CCA CCA AGT GAA GGT GAA CCA GTT 
Gly Pro Phe He His Gly Gin Thr Pro Pro Pro Ser Glu Gly Glu Pro Val 

TTT AGT GGC CTC AAC GAT GTT CGT GTG CAT CAC GGT AGT TCA TTT GAC CCG 
Phe Ser Gly Leu Asn Asp Val Arg Val His His Gly Ser Ser Phe Asp Pro 



TAT GCA GGT GTT ACT GCG TCT GAT AAA GAA GAT GGA GAC CTA ACC AAC AGC 
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Tyr Ala Gly Val Thr Ala Ser Asp Lys Glu Asp Gly Asp Leu Thr Asn Ser 

ATC ACT GTC GAA GGT TCA GTT GAT GTG AAC ACG GTA GGC ACA TAT GTT TTG 
He Thr Val Glu Gly Ser Val Asp Val Asn Thr Val Gly Thr Tyr Val Leu 

GTT TAC AGT GTA AAA GAC AGC GAC AAC AAT GAA ACC AAG CAA AGT AGA ACG 
Val Tyr Ser Val Lys Asp Ser Asp Asn Asn Glu Thr Lys Gin Ser Arg Thr 

GTT GTT GTT TAC AGC CTA GTG CCT GAG TTT GAA GGT GTC GCA GAT ACG ACC 
Val Val Val Tyr Ser Leu Val Pro Glu Phe Glu Gly Val Ala Asp Thr Thr 

ATC CAG CTT GGT GAC GCT TTT GAC CCA ATG GCA GGC GTA AAA GCG ACC GAT 
He Gin Leu Gly Asp Ala Phe Asp Pro Met Ala Gly Val Lys Ala Thr Asp 

GCA GAA GAC GGT GAT TTG ACT GAT CGG TAT CTA CGC CGC CTA AGG TCA CTT 
Ala Glu Asp Gly Asp Leu Thr Asp Arg Tyr Leu Arg Arg Leu Arg Ser Leu 

CTG CGG TGC GAT AGC CTT CTG TGC CAT TTG GTG CAA CCG CCC AGT TTT CCA 
Leu Arg Cys Asp Ser Leu Leu Cys His Leu Val Gin Pro Pro ser Phe Pro 



GAC GCT CAA CGA TGG TTG CCA TCT CTT TCT GGT TGA 1514 
Asp Ala Gin Arg Trp Leu Pro Ser Leu Ser Gly END 
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